Europaisches Patentamt 
European Patent Office 
Office europ^en des brevets 



© Publication number: 



0 408178 A1 



© 



EUROPEAN PATENT APPLICATION 



© Application number: 90305415.3 
© Date of filing: 18.05.90 



© int. CIA C07K 13/00, C12N 15/31, 
C07K 3/20 



© Priority: 19.05.89 US 354264 

© Date of publication of application: 
16.01.91 Bulletin 91/03 

© Designated Contracting States: 

AT BE CH DE DK ES FR GB GR IT LI LU NL SE 



© Applicant: GENEX CORPORATION 
16020 Industrial Drive 
Gaithersburg Maryland 20877(US) 

© Inventor: Fahnestock, Stephen R. 
18818 Luray Court 
Olney, Maryland 20832(US) 
Inventor: Lee, Timothy 
9701 Fields Road, No. 406 
Gaithersburg, Maryland 20878(US) 
Inventor: Wroble, Marie H. 
13214 Gerlach Court 
Mt. Airy, Maryland 21 771 (US) 

© Representative: Sheard, Andrew Gregory et ai 
Kilburn & Strode 30, John Streeteet 
London WC1N2DD(GB) 



© Immobilized protein G variants and the use thereof. 

© Protein G variants and processes for their preparation by recombinant techniques are disclosed, as well as 
use of the same technology to make known Protein G. Such variants are useful for separating IgG and fragments 
thereof. 



00 

oo 



o. 

LU 



Xerox Copy Centre 



EP 0 408 178 Al 



IMMOBILIZED PROTEIN G VARIANTS AND THE USE THEREOF 



This present invention relates to immobilized Protein G variants and the use thereof. 

There has been growing interest in recent years in bacterial Ig (immunoglobulin) receptors, molecules 
that bind to antibodies through a nonimmune mechanism. This binding is not to the antigen recognition site, 
which is located in the variable portion of the antibody molecule, but to the constant portion of the antibody. 
5 The constant region is common to many types of antibodies, thus bacterial Ig receptors can bind to many 
types of antibodies. This property makes bacterial Ig receptors useful in a number of immunochemical 
applications. 

Bacterial ig receptors have a number of useful or potentially useful applications, primarily in the 
detection of antibodies, the purification of antibodies and the treatment of diseases. The detection of 

10 antibodies is required in several phases of laboratory research in immunology, including the screening of 
hybridoma clones for the secretion of specific monoclonal antibodies, the measurement of the immune 
response of an immunized animal, and the quantitation of antigens by competitive binding assays. Methods 
for detecting antibodies using bacterial Ig receptors have been found to be more sensitive and less prone to 
interference and high background signals than other detection methods [Boyle, M.D.P., Biotechniques 

15 2:334-340 (1984)]. 

Ig receptors also are useful in purifying antibodies to be used in the purification of protein drugs and as 
therapeutics. Although a number of methods are known, a popular method involves the use of affinity 
chromatography on columns of immobilized bacterial Ig receptors. This method is preferred because the 
columns can be reused many times, thus lowering the expense of purification. 

20 A number of potential clinical uses of bacterial Ig receptors are currently under investigation. They 
include passing plasma over extracorporeal columns of immobilized Ig receptors, then reinfusing the treated 
plasma. See, for example, Tenan, D.S., et al., N. Eng. J. Med.305 :1 195-1 200 (1981). 

The best known bacterial Ig receptor is Protein A of Staphylococcus aureus , which binds to the 
constant F c domain of immunoglobulin IgG. Other bacterial Ig receptors also have been identified. One of 

25 these is known as Protein G of Group G streptococci. Although Protein G is analogous to Protein A, Protein 
G has several important advantages. For example, Protein G binds to all subclasses of human IgG, whereas 
Protein A does not bind to the igG3 subclass [Reis, K. J. et al. J. Immunol. 132 :3098-3102 (1984)]. Protein 
G also is specific for IgG and does not cross-react with human antibodies of type IgA and IgM as Protein A 
does. [Myhre, E.B. and Kronvall, G. "Immunoglobulin Specificities of Defined Types of Streptococcal Ig 

30 Receptors" In : Basic Concepts of Streptococci and Streptococcal Diseases ; J.E. Holm and P. Christensen, 
eds.; Redbook, Ltd., Chertsey, Surrey; pp. 209-~210 (1983)]. In addition, Protein G binds to certain animal 
IgGs to which Protein A binds weakly or not at all. These include bovine, ovine, and caprine IgGl and 
several subclasses of equine IgG (Reis, K. J. et al., supra ). Protein G also has been found superior to 
Protein A in binding to several subclasses of murine monoclonal antibodies [Bjorck, L and Kronvall, G. J. 

35 Immunol. 133 :969-974 (1984)]. For these reasons, Protein G is likely to become the bacterial Ig receptor of 
choice in a~variety of applications. 

Protein G may be obtained for study by purification from Streptococcal strains which naturally produce 
it. For example, Streptococcal cells have been treated with proteolytic enzymes (e.g., papain or trypsin) to 
soiubilize the Protein G (which is a cell wall protein), followed by known protein purification procedures 

40 (e.g., ion exchange chromatography, gel filtration, and affinity chromatography) to further purify the Protein 
G (European Patent Application, Publication Number 0 131 142). 

Thus the present invention in its broadest terms addresses the advantages, uses and potential uses of 
Protein G and variants thereof. It is appreciated that it would be desirable to be able to produce such 
variants using recombinant DNA methodology and to immobilize the variants for use in chromatography. 

45 Accordingly, it is an object of the present invention to construct and clone genes encoding Protein G and 
variants thereof and to produce Protein G and variants thereof by transforming a microbial host with the 
cloned gene and cultivating the host under Protein G-producing conditions. It is also an object of the 
present invention to immobilize Protein G and variants thereof on solid supports and to use the immobilized 
variants in chromatographic separation processes. 

50 Thus a first of the present invention relates to a nucleic acid sequence encoding for Protein G or a 
Protein G variant. 

A second aspect of the present invention relates to a nucleic acid sequence encoding for an active 
binding site or a portion thereof of Protein G or a Protein G variant. 

Thus in its broadest terms, the invention in a third encompasses recombinant Protein G or a variant 
thereof. 
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To these ends, the invention in a fourth aspect covers a vector comprising a nucleic acid sequence 
encoding for Protein G or a variant of Protein G, or an active binding site thereof. The vector is preferably 
lambda gt11, m13mp9, pBR322 or pGx1066. 

A fifth aspect of the invention relates to a host transfected with the vector of the fourth aspect. The host 
5 is preferably E. coli or Bacillus subtilis . 

A sixth aspectfbf the present invention relates to a Protein G variant. 

It will be appreciated that preferred features and characteristics of one aspect of the present invention 
are as for another aspect mutatis mutandis . 

The invention is directed to immobilized Protein G variants on a solid phase support and the use 
w thereof. In particular, the invention also is directed to methods for detecting, isolating and purifying 
immunoglobulins and immunoglobulin fragments using the immobilized Protein G variants of the invention. 

Surprisingly, it has been discovered that the immobilized Protein G variants of the invention exhibit 
binding characteristics with immunoglobulin and immunoglobulin fragments which are unlike those of 
Protein G. Therefore, the immobilized Protein G variants of the invention can be used in new and 
75 unexpected ways to effect the detection, isolation and purification of immunoglobulins and immunoglobulin 
fragments. They may also have a high binding capacity for Ig or fragments thereof. 

The present invention provides cloned genes encoding Protein G variants. By the term "Protein G 
variant" is intended proteins comprising one or more natural IgG binding domains of Protein G, hybrids of 
the natural IgG binding domains, and mutants thereof wherein the variant retains the capability of binding Ig, 
20 or fragments thereof (the term "variant" may be taken to exclude naturally occurring Protein G). Thus a 
"variant" can be thought of as a protein differing from naturally occurring Protein G by the deletion, addition 
or substitution of one or more amino acid residues so as not to substantially alter the sequence or 
properties thereof. 

By the term "hybrid sequence" is intended DNA or amino acid sequences which contain portions of the 
25 respective sequences corresponding to B1 and B2 and which retain immunoglobulin binding properties. 
Such a hybrid sequence is shown in Figure 9 and is labeled B3. This hybrid sequence comprises the first 6 
amino acids of the binding domain B2 fused to the last 49 amino acids of the B1 domain. Thus, it is 
intended that ail such hybrid sequences which retain immunoglobulin binding properties are within the 
scope of this invention. 

30 Methods for obtaining such Protein G variants are described in detail in the Examples, below. One of 
ordinary skill in the art can screen such Protein G variants for the capability of binding Ig or fragments 
thereof with no more than routine screening. For example, the Protein G variant may be contacted with Ig or 
a fragment thereof followed by detection of binding between the Protein G variant and Ig or fragment 
thereof. Methods for the detection of binding between two proteins are well known to those of ordinary skill 

35 in the art. 

Protein G genes may be derived from the Protein G gene of the Streptococcus Sp. , Lancefield Group 
G strain or from strain G148. The invention also provides for Protein G genes inserted into a cloning vector. 
Cells of a prokaryotic organism which have been stably transformed with recombinant vectors are 
disclosed. 

40 The invention further provides the identification of the nucleotide sequence and amino acid sequence 
for the active binding sites of the Protein G molecule. Also provided are eleven genes encoding single 
domain Protein G variants. The Protein G variants expressed therefrom, when immobilized on a solid phase 
support, exhibit altered Ig binding specificities unlike those of Protein G. Protein G variants do not contain 
the lethal sequences which naturally flank the protein G gene and which limit the quantity of protein G 

45 which may be expressed by a recombinant host. 

Cloning of the Protein G gene and production of Protein G and Protein G variants in bacterial hosts, 
such as E. coli or Bacillus subtilis , through recombinant DNA technology, according to the methods of the 
present irwentibn, provides a number of advantages over current methods for obtaining the protein. By the 
methods of this invention, relatively high production levels of microbial Protein G and protein G variants can 

50 be obtained. Moreover, the proteins can be produced under conditions where they can be isolated more 
favorably, and the proteins can be produced in a non-pathogenic host. The cloned genes may be inserted 
into various multicopy expression vectors to give enhanced levels of these valuable IgG-binding proteins in 
cultured E. coli cells transformed with the recombinant expression vectors. Production of Protein G and 
Protein GT variants in E. coli or B. subtilis cells is preferable to cultivation of Protein G-producing 

55 Streptococcal strains, wtiich~are commonly pathogenic strains. 

The first step in the cloning of the Protein G gene, according to the present invention, is isolation of 
Streptococcal strains that produce Protein G. This may be done by assaying various strains for IgG binding 
activity using any suitable immunoassay technique. A technique used by the Applicant is the colony 
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immunoassay described in detail in the examples section below. Strains found to have IgG-binding activity 
are next tested for the ability to bind lgG3 as well as unfractionated IgG, since the ability to bind lgG3 is a 
desired property associated with Protein G. A hemagglutination assay using red blood cells coated either 
with lgG3 or with unfractionated IgG (described in detail in the examples below) is a convenient method for 

5 identifying Protein G-producing strains. A known Protein A-producing strain, such as Staphylococcus aureus 
Cowan I [Sjoquist, J., Eur. J. Biochem. , 78 : 471-490 (1977)], may be used as a control, since Protein A 
binds unfractionated IgG buFnot lgG3. 

Chromosomal DNA is isolated from strains found to produce Protein G by cultivating the strains in a 
nutrient medium to a desired ceil density, then lysing the cells by any of the conventional chemical, 

10 mechanical, and/or enzymatic methods known in the art. Conventional extraction and precipitation proce- 
dures are used to isolate the chromosomal DNA. Fragments of DNA of a suitable size for cloning are 
obtained by such known mechanical methods as sonication or high-speed stirring in a blender, or by 
enzymatic methods, such as partial digestion with DNAsel, which gives random fragments, or with 
restriction endonucleases, which cleave at specific sites. 

15 The chromosomal DNA fragments then are inserted into a cloning vector. Any suitable plasmid or 
bacteriophage cloning vector may be used. For a vector to be suitable, it should have several useful 
properties. It should have an origin of replication that is functional in the intended microbial host cells, and a 
selectable marker (such as an antibiotic resistance gene) to aid in identification of host cells that have been 
transformed with the vector. It should be able to accept inserted DNA fragments and still replicate normally. 

20 Preferably, the vector comprises one or more unique restriction endonuclease recognition sites at which 
DNA fragments can be inserted without destroying the vector's ability to replicate. 

Suitable cloning vectors include phage derivatives such as lambda gtll [Young and Davis, Proc. Nat'l 
Acad. Sci. U.SA , 80 :1 194-1 198 (1983)], the various phage M13-derived vectors such as M13mp9 
(commercially availabie~from Bethesda Research Laboratories), plasmids such as pBR322, and many others 

25 [Old and Primrose, Principles of Gene Manipulation , 2nd. Ed., Univ. of Calif. Press, pgs. 32-35 and 46-47 
(1981)]. The Applicant used a pBR322-derived plasmid vector pGX1066, shown in Figure 1. 

The Streptococcal DNA is inserted into the cloning vector by such methods as homopolymeric tailing or 
by using linker molecules (Old and Primrose, supra at page 92). Advantageously, the vector is linearized 
with a restriction endonuclease, and the chromosomal DNA is also digested with a restriction endonuclease 

30 that produces DNA fragments that are ligatable to the ends of the linearized vector molecule. The 
Streptococcus-derived DNA fragments are thus advantageously inserted into the cloning vector in a 
standard reaction using the enzyme T4 DNA ligase. 

Bacterial cells are transformed with the recombinant cloning vector, using standard procedures, and the 
bacterial colonies are screened for production of Protein G. Assays such as the colony immunoassay and 

35 the hemagglutination assay described in the examples below are suitable for identification of recombinant 
strains producing Protein G. As described more fully in Example I below, the initial positive colony identified 
was unstable. Through purification procedures in which this clone underwent several rounds of restreaking, 
a derivative of the clone was obtained which appeared stable and produced Protein G. This strain .was 
designated E. coli GX7820. 

40 Plasmid~DNA from this strain was isolated and then analyzed by restriction analysis followed by gel 
electrophoresis. It has been determined that the strain contains two plasmids. One, designated pGX1066X, 
appears to be approximately the same size as the pGX1066 cloning vector; the other, designated pGX4530, 
appears to be pGX1066 containing an 11 kilobase-pair (kbp) insert. Although the Applicant does not wish to 
be bound by a particular theory, it appears, as illustrated more fully in the examples, that pGX1066X is a 

45 "cryptic helper plasmid", a derivative of pGX1066 in which the ampicillin resistance gene is no longer 
intact. The original transformant strain probably contained pGX1066 and pGX4530, and was unstable 
because pGX4530 was lost from the cells due to lack of selective pressure to retain that plasmid when 
pGX1066 was present to provide ampicillin resistance. Once pGX1066X appeared, having a mutation that 
inactivated its ampicillin resistance gene, only those host cells which had retained pGX4530 (having an 

so intact ampicillin resistance gene) could survive on the ampicillin plates. Plasmid pGX1066X is retained in 
the cells containing both plasmids, presumably because it serves to limit the copy number of pGX4530 in 
the cell. Plasmid pGX4530 alone is lethal to the host cells (see Example 1) since it has DNA sequences 
which are lethal to the host. However, the presence of pGX1066X in the same host cell reduces the copy 
number of pGX4530 to a tolerable level. The plasmids are from the same "incompatibility group", i.e., the 

55 plasmids compete with each other for maintenance in the cell, so that each plasmid limits the copy number 
of the other in the host cell. E. coli strain GX7820 has been deposited with the American Type Culture 
Collection in Rockville, Maryland, and given accession number 53460. 

An E. coli strain was transformed with a mixture of the plasmids isolated from strain GX7820. The 
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transformation resulted in a number of tiny, strongly positive colonies with a few (about 20%) resembling 
GX7820. From these tiny positive colonies have been isolated two stable variants which do not carry the 
helper plasmid and which are more strongly positive for Protein G than the original GX7820. One strain, 
designated GX7823, carries a plasmid (pGX4533) from which has been deleted a two kilobase pair (kbp) 

5 fragment of the insert in the pGX4530 plasmid. E. coli strain GX7823 has been deposited with the American 
Type Culture Collection in Rockville, Maryland and given accession number 53461. The other, designated 
GX7822, carries a plasmid which has acquired a three kbp insert of DNA within the original insert at a site 
very close to one end of the deletion in the plasmid carried by the GX7823 strain. The Protein G gene has 
been located on a 1 .9 kilobase pair (kbp) fragment of the Streptococcal DNA insert on pGX4533. 

w To improve Protein G production levels, the cloned Protein G gene may be inserted into a variety of 
expression vectors. The expression vectors comprise "regulatory regions", which include DNA sequences 
necessary for gene expression, i.e., the transcription of DNA into mRNA followed by translation of the 
mRNA into the protein that the gene encodes. The Protein G gene may contain its natural expression 
signals, or those signals may be removed and the structural portion of the cloned Protein G gene (i.e., the 

75 protein-encoding portion of the gene) can be operably fused, in accordance with conventional methods, to 
other expression signals, contained in an expression vector, which are capable of directing the Protein G 
gene in the chosen host organism. For example, when the host microorganism is E. coli , the expression 
vector may comprise such known regulatory regions as the trp promoter/operator, the lac 
promoter/operator, the bacteriophage lambda P L promoter/operator, and many others. 

20 In one embodiment of the invention, the expression vector further comprises a DNA sequence 
homologous to a region of the chromosome of the host microorganism. This construction permits linear 
integration of the vector into the host chromosome in the region of homology. An advantage to this method 
is that there is less likelihood of loss of the Protein G sequence from the host, due to negative selection 
favoring vector-free cells. 

25 Protein G may be produced at high levels in bacterial cells transformed with such recombinant 
expression vectors. In addition, production of Protein G within the cell may be controlled by using 
promoter/operator systems which may be induced (to begin gene expression) at a desired cell density, or in 
which gene expression can be reversibly repressed until the cell density in a culture of recombinant 
bacterial cells has reached a desired level. The potentially negative effects on cell growth of production of a 

30 heterologous protein can thus be avoided. 

Transformed cells containing a cloned Protein G or a Protein G variant gene are cultivated under 
protein- producing conditions such that Protein G or a Protein G variant is produced by the cells. Cultivation 
conditions, including large-scale fermentation procedures, are well known in the art. The cells may be 
cultivated under any physiologically-compatible conditions of pH and temperature, in any suitable nutrient 

35 medium containing assimilable sources of carbon, nitrogen and essential minerals that supports cell growth. 
Protein-producing cultivation conditions will vary according to the type of vector used to transform the host 
cells. For example, certain expression vectors comprise regulatory regions which require cell growth at 
certain temperatures, or addition of certain chemicals to the cell growth medium, to initiate the gene 
expression which results in production of Protein G or a Protein G variant. Thus, the term "protein 

40 producing conditions" as used herein is not meant to be limited to any one set of cultivation conditions. 

Advantageously, the cloned gene is transferred to B. subtiiis by methods previously applied to the gene 
encoding Protein A and described in commonly assigned United States Patent No. 4,617,266 (1986), 
incorporated herein by reference in its entirety. In accordance with these methods, Protein G can be 
synthesized in B. subtiiis . 

45 The functionally active portions of Protein G was localized to a repeating structure by examining the 
IgG- binding activity of protein produced by E. coli strains carrying modified forms of the cloned protein G 
gene. In a preferred embodiment, the invention also relates to a cloned gene which encodes one or more of 
the Ig binding portions of Protein G (Protein G variants) and to the protein so produced which has 
immunoglobulin binding properties. The details of the identification and isolation of the gene coding for the 

50 active binding sites of Protein G are set forth in Example III below. The DNA sequences, and the amino acid 
sequences encoded thereby, of two genes encoding, respectively, two and three active sites per Protein G 
molecule are set forth in Figures 8 and 9. With this information, it is now possible to produce Protein G 
variants which contain one or more Protein G binding sites. Synthetic genes may be constructed, utilizing 
known synthetic procedures, which code for from one to twenty or more active sites within a given amino 

55 acid sequence, thereby providing higher binding efficiency, capacity, and specificity to the resulting 
material. A preferred Protein G variant contains 1 to 10 active binding sites; a more preferred material 
contains one active binding site. 

Also within the scope of this invention are Protein G variants having immunoglobulin-binding properties 
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which, further have deletions or substitutions of amino acids or additional amino acids at the amino or 
carboxyl terminus thereof. 

p re f errec i forms of the Protein G variants are encoded by genes from which coding sequences 
upstream and downstream from the active sites (B1 and B2) have been deleted. The details regarding the 
5 deletion of such coding sequences are set forth in the Examples, below. 

In a preferred embodiment, a cloned gene encoding Protein G variant which encodes a protein 
comprising the B1 and B2 domains (Type 1), which may be used in the practice of the invention, has the 
following DNA sequence: 



10 20 30 

ATGGATCCTG CATTACCTAA 6ACTGACACT 
AAAGGCGAAA CAACTACTGA AGCTGTTGAT 
TACGCTAACG ACAACGGTGT TGACG6TGAA 
ACAGTTACTG AAAAACCAGA AGTGATCGAT 
TACAAACTTG TTATTAATGG TAAAACATTG 

5 310 320 330 

GCAGAAACTG CAGAAAAAGC CTTCAAACAA 
TGGACTTATG ATGATGCGAC TAAGACCTTT 
8TCGCTTCAA AACGTAAAGA AGACTAA 



40 50 60 

TACAAATTAA TCCTTAATGG TAAAACATTG 
GCTGCTACTG CAGAAAAAGT CTTCAAACAA 
TGGACTTACG ACGATGCGAC TAAGACCTTT 
GCGTCTGAAT TAACACCAGC CGTGACAACT 
AAAG6C6AAA CAACTACTAA AGCAGTAGAC 

340 350 360 

TACGCTAACG ACAACGGTGT TGATGGTGTT 
ACGGTAACTG AAATGGTTAC AGAGGTTCCG 



or a degenerate variant thereof. 

The Protein G variant Type 1 expressed from the above disclosed sequence has the following amino 
acid sequence: 



25 



30 



5 10 15 20 25 30 

ItlDPALPKTDTYKLILNGKTLKGETTTEAVD 
31 AATAEKVFKQYA1MDNGVDGEWTYDDATKTF 
61 TVTEKPEV I DASELTP A'V TTYKLV INGKTL 
91KGETTTKAVDAETAEKAFKQYAN0NGVDGV 
121 WTYDDATKTFTVTEMVTEVPVASKR K'E D. 

A second cloned gene encoding a Protein G variant which comprises the B1 and B2 binding domains 
(Type 2), which may be used in the practice of the invention, has the following DNA sequence: 



35 iq 20 30 

ATGGATCCGT ACCCATTACC TAAGACTGAC 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT 
CAATACGCTA ACGACAACGG TGTTGACGGT 
TTTACAGTTA CTGAAAAACC AGAAGTGATC 
ACTTACAAAC TTGTTATTAA TGGTAAAACA 

40 

310 320 330 

GACGCAGAAA. CT6CAGAAAA AGCCTTCAAA 
GTTTGGACTT ATGATGATGC GACTAAGACC 
CCTCGAGGTG ATGCACCAAC TGAACCAGAA 
TTAACTCCTG CAACTCCAAT TGCTAAAGAT 
45 GATGCTAAAA AACCAGAAGC TAAGAAAGAT 



40 50 60 

ACTTACAAAT TAATCCTTAA TGGTAAAACA 
GATGCTGCTA CTGCAGAAAA AGTCTTCAAA 
GAATGGACTT ACGACGATGC GACTAAGACC 
GATGCGTCTG AATTAACACC AGCCGTGACA 
TTGAAAGGCG AAACAACTAC TAAAGCAGTA 

340 350 360 

CAATACGCTA ACGACAACGG TGTTGATGGT 
TTTACGGTAA CTGAAATGGT TACAGAGGTT 
AAACCAGAAG CAAGTATCCC TCTTGTTCCG 
BACGCTAAGA AAGACGATAC TAAGAAAGAA 
GACGCTAAGA AAGCTGAAAC TGCCGGCTAA 



or a degenerate variant thereof. 

The Protein G variant Type 2 expressed from the second cloned gene having the above disclosed 
sequence has the following amino acid sequence: 

50 
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A third cloned gene encoding a Protein G variant comprising the B1 and B2 binding domains (Type 3), 
which may be used in the practice of the invention, has the following DNA sequence: 



10. 80 30 40 50 60 

ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 

TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAPA AGTCTTCAAA 

CAATACGCTA AC6ACAAC6G TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 

TTTACAGTTA CTGAAAAACC iAGAAGTGATC GATGCGTCTG AATTAACACC AGCCGTGACA 

ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 

310 380 330 340 350 360 

GACGCAGAAA CTGCAGAAAA AGCCTTCAAA CAATACGCTA ACGACAACGG TGTTGATGGT 
GTTTGGACTT ATGATGATGC GACTAAGACC TTTACGGTAA CTGAAATGGT TACAGAGGTT 
CCTCGATCGT GCTAA 



or a degenerate variant thereof. 

The Protein Q variant Type 3 expressed from the third cloned gene having the above disclosed 
sequence has the following amino acid sequence: 



5 10 IS 80 85 30 

1MDPYPLPKTOTYKLILKGKTLKGETTTEAV 
31DAATAEKVFKQYANDNGVDGEUTYDDATKT 
61FTVTEKPEVIDASELTPAVTTYKLVINGKT 
91LKGETTTKAVDAETAEKAFKQYANDN6VDG 
181 VWTYOOATKT, FTVTE MVT'EV P R 6 C • 



25 A fourth cloned gene encoding a Protein G variant comprising the B1 and B2 domains (Type 4), which 
may be used in the practice of the invention, has the following DNA sequence: 

10 80 30 40 50 60 

GTGAGAGGCA AAAAAGTATG GATCAGTTTG CTGTTTGCTT TAGCGTTAAT CTTTACGATG 
30 GCGTTCGGCA GCACATCCTC TGCCCAGGCG GCAGGGGATC CAATCGAAGA TACCCCAATT 

ATTCGTAATG GTGGTGAATT AACTAATCTT CTGGGGAATT CAGAGACAAC ACTGGCTTTG 
CGTAATGAAG AGAGTGCTAC AGCT6GGTAC CCATTACCTA AGACTGACAC JTACAAATTA 
ATCCTTAATG GTAAAACATT GAAAGGCGAA AC A ACT ACT G AAGCTGTTGA TGCTGCTACT 



35 



3X0 380 330 34© 350 360 

8CAGAAAAAG TCTTCAAACA ATACGCTAAC ©ACAAC6GTG TTGACGGTGA ATGGACTTAC 
GACGATGCGA CTAAGACCTT TACAGTTACT GAAAAACCAG AAGTGATCGA TGCGTCT6AP 
TTAACACCAG CCGTGACPAC TTACAAACTT GTTATTAATG GTAAAACATT GAAAGGCGAA 
ACAACTACTA AAGCAGTAGA CGCAGAAACT GCAGAAAAAG CCTTCAAACA ATACGCTAAC 
GACAACGGTG TTGATGGT6T TTGGACTTAT GATGATGCGA CTAAGACCTT TACGGTAACT 
tin 680 630 640 650 660 

6AAATGGTTA CAGAGGTTCC TCGAGGTGAT GCACCAACTG AACCAGAAAA ^CAGAAGCP 
AGTATCCCTC TTGTTCCGTT AACTCCTGCA ACTCCAATTG CTAAAGATGA CGCTAAGAAA 
GACGATACTA AGAAAGAAGA TGCTAAAAAA CCAGAAGCTA AGAAAGATGA CGCTAAGAAA 
GCTGAAACTG CCQGCTAA 

45 or a degenerate variant thereof. 

The Protein G variant Type 4 expressed from the fourth cloned gene having the above disclosed 
sequence has the following amino acid sequence (without the 30 amino acid secretion sequence): 



40 



50 



55 



5 10 15 80 85 30 

1AGDP1EDTP I IRNGGELTNLLGN6ETTLAL 
31 RNEE6ATAGYPLPKTDTYKL2 LNGKTLKGE 
61TTTEAVDAATAEKVFKGYANDNGVDGEWTY 
91DDATKTFTVTEKPEVIDABELTPAVTTYKL 
181 VINGKTLKGETTTKAVDAETAEKAFKQYAN 
151DNGVDGVWTYDDATKTFTVTEMVTEVPR6D 
181 APTEPEKPEASI PLVPLTPATPIAKDDAKK 
811 DDTKKEDAKKPEAKKDDAKKAETAG, 

A fifth cloned gene encoding a Protein G variant which comprises the single binding domain B2 (Type 
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5), which may be used in the practice of the invention, has the following DNA sequence: 

10 20 30 40 SO GO 

ATGGATCCAG GCGATGCGTC TGAATTAACA CCAGCC6TGA CAACTTACAA ACTTGTTATT 
5 AATGGTAAAA CATTGAAAGG CGAAACAACT ACTAAAGCAG TAGACGCAGA AACTGCAGAA 

PAAGCCTTCA AACAATACGC TAACGACAAC GGTGTTGATG GTGTTTGGAC TTATGATGAT 
GCGACTAAGA CCTTTACGGT AACTGAAATG BTTACAGAGG TTCCTCGAGG TGATGCACCA 
ACTGAACCAG AAAAACCAGA AGCAAGTATC CCTCTTGTTC CGTTAACTCC TGCAACTCCA 

310 330 330 340 350 360 

10 ATTGCTAAAG ATGACGCTAA GAAAGACGAT ACTAAGAAAG AAGATGCTAA AAAACCAGAA 

GCTAAGAAAG ATGACGCTAA 6AAAGCTGAA ACTGCCGGCT AA 

or a degenerate variant thereof. 

The Protein G variant Type 5 expressed from the fifth cloned gene having the above disclosed 
75 sequence has the following amino acid sequence: 

5 10 15 80 25 30 

1 MDPGDA5ELTPAVTT YKLVI NGKTLKGETT 
3XTKAVDAETAEKAFKQYANDNGVDGVWTYDD 
20 eiATKTFTVTEMVTEVPRGDAPTEPEKPEASI 
91 PLVPLTPATP XAKDDAKKDDTKKEDAKKPE 
121 A K K D D A K K A E T A G. 



25 



A sixth cloned gene encoding a Protein G variant comprising the B1, B1/B2 hybrid domain and B2 
domain (Type 6), which may be used in the practice of the invention, has the following DNA sequence: 



ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGTCTTCAAA 

30 CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 

TTTACAGTTA CTGAAAAACC A6AAGTBATC GATGCGTCTG AATTAACACC AGCCGTGACA 
ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 
3 1 0 320 330 340 350 360 

GACGCAGAAA CTGCAGAAAA AGTCTTCAAA CAATACGCTA ACGACAACGG TGTTGACGGT 
GAATGGACTT ACGACGATGC GACTAAGACC TTTACAGTTA CTGAAAAACC AGAAGTGATC 
GATGCGTCTG AATTAACACC AGCCGTGACA ACTTACAAAC TTGTTATTAA TGGTAAAACA 

35 TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGCCTTCAAA 

CAATACGCTA ACGACAACGG TGTTGATGGT GTTTGGACTT ATGATGATGC GACTAAGACC 
610 £20 630 640 650 6GO 

TTTACGGTAA CTGAAATGGT TACAGAGGTT CCTCGAGGTG ATGCACCAAC TGAACCAGAA 
AAACCAGAAG CAAGTATCCC TCTTGTTCCG TTAACTCCTG CAACTCCAAT TGCTAAAGAT 
BACGCTAAGA AAGACGATAC TAAGAAAGAA GATGCTAAAA AACCAGAAGC TAAGAAAGAT 

40 GACGCTAAGA AAGCTGAAAC TGCCQGCTAA 

or a degenerate variant thereof. 

The Protein G variant Type 6 expressed from the sixth cloned gene having the above disclosed 
45 sequence has the following amino acid sequence: 



50 



55 
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A seventh cloned gene encoding a Protein G variant comprising the single B2 binding domain (Type 7), 
which may be used in the practice of the invention, has the following DNA sequence: 
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or a degenerate variant thereof. 

The Protein G variant Type 7 expressed from the seventh cloned gene having the above disclosed 
10 sequence, has the following amino acid sequence: 

5 10 15 20 25 30 

1MDPGDPSELTPQVTTYKLVXNGKTLKGE1 T 
31 TKAVDAETAEKAPKGYANDNGVDGVWTYDD 
15 61ATKTFTVTEMVTEVPVASKRKED, 

An eighth cloned gene encoding a Protein G variant comprising the binding domains B1 and B2 (Type 
8), which may be used in the practice of the invention, has the following DNA sequence: 



20 



25 



10 20 30 40 50 60 

1 ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
61 TTGAAAGGCG AAACAACTAC TGAAGCTGTT GAT6CTGCTA CTGCAGAAAA AGTCTTCAAA 
121 CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 
181 TTTACAGTTA CTGAAAAACC A6AAGTGATC GAT6CGTCTG AATTAACACC AGCCGTGACA 
241 ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 



310 320 330 340 350 360 

301 GACGCAGAAA CTGCAGAAAA AGCCTTCAAA CAATACGCTA ACGACAACGG TGTTGATGGT 

361 GTTTGGACTT ATGATGATGC GACTAAGACC TTTACGGTAA CTGAAATGGT TACAGAGGTT 

421 CCTCGAGGTG ATGCACCAAC TGAACCAGAA AAACCAGAAG CAAGTATCCC TCTTGTTCCG 

30 481 TTAACTCCTG CAACTCCAAT TGCTAAAGAT GACGCTAAGA AAGACGATAC TAAGAAAGAA 

541 GATGCTAAAA AACCAGAAGC TAAGAAAGAT GACGCTAAGA AAGCTGAAAC TGCCCCTTCA 



35 



40 



601 TGCTAA 

or a degenerate variant thereof. 

The Protein G variant Type 8 expressed from the eighth cloned gene having the above disclosed 
sequence, has the following amino acid sequence: 

5 10 15 20 25 3C 

1 MDPYPLPKTOTYKL ILNGKTLKGETTTEAV 
31DAATAEKVFK0YANDNGVDGEWTYDDATKT 
61 FTVTEKPEV1 DASELTPAVTTYKLVI NGKT 
91 LK6ETTTKAVDAETAEKAFKDYANDNGVDG 
121 VWTYDDATKTFTVTEMVTEVPRGDAPTEPE 
151 KPEAS1 PLVPLTPATPIAKDDAKKODTKKE 
4& J81DAKKPEAKKDDAKKAETAPSC. 

A ninth cloned gene encoding a Protein G variant comprising the B1 and B2 binding domains (Type 9), 
which may be used in then practice of the invention, has the following DNA sequence: 

50 



55 
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10 20 30 40 50 60 

1 ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 

61 TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGTCTTCAAA 

121 CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT AC6ACGATGC GACTAAGACC 

161 TTTACAGTTA CTGAAAAACC AGAAGTGATC GATGCGTCTG AATTAACACC AGCCGTGACA 

5 241 ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 

310 320 330 340 350 360 

301 GACGCAGAAA CTGCAGAAAA AGCCTTCAAA CAATACGCTA ACGACAACGG TGTTGATGGT 
361 GTTTGGACTT ATGATGATGC GACTAAGACC TTTACGGTAA CTGAAATGGT TACAGAGGTT 
421 CCTCGAGGTG ATGCACCAAC TGAACCAGAA AAACCAGAAG CAAGTATCCC TCTTGTTCCG 
10 481 TTAACCAGCT GCTAA 

or a degenerate variant thereof. 

The Protein G variant Type 9 expressed from the ninth cloned gene having the above disclosed 
sequence, has the following amino acid sequence: 

75 

5 10 15 20 25 * 30 

1 MDPYPLPKTDTYKL 1 LNGKTLKGETTTEAV 
31DAATAEKVFKQYANDNGVDGEWTYDDATKT 
on eiFTVTEKPEVIDASELTPAVTTYKLVINGKT 
20 91 LKGETTTKAVOAET AEKAFKGYANDNGVD6 

121 VWTYODATKTFTVT EWVTEVPRGDAPTEPE 
151 KPEASI PLVPLTSC* 



25 



30 



35 



A tenth cloned gene encloding a Protein G variant comprising the single B1 binding domain (Type 10), 
which may be used in the practice of the invention, has the following DNA sequence: 

10 20 30 40 50 60 

ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA ^TCTTCAAA 
CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 
TTTACAGTTA CTGAAAAACC AGAAGTGATC GACGTCCCTC GAGGTGATGC ACCAACTGAA 
SSSaaac CAGAAGCAAG TATCCCTCTT GTTCCGTTAA CTCCTGCAAC TCCAATTGCT 

31« 320 330 340 350 360 

AAAGATGACG CTAAGAAAGA CGATACTAAG AAAGAAGATG CTAAAAAACC AGAAGCTAAG 
AAAGATGACG CTAAGAAA6C TGAAACTGCC GGCTAA 



or a degenerate variant thereof. 

The Protein G variant Type 10 expressed from the tenth cloned gene having the above disclosed 
40 sequence, has the following amino acid sequence: 

5 10 IS 20 25 30 

1 MDPYPLPKTDTYKL X LNGKTLKGETTTEAV 
31DAATAEKVFKQYANDNGVDGEWTYDDATKT 
45 61 FTVTEKPEVIDVPRGDAPTEPEKPEASIPL 

91 VPLTPATP XOKDDAKKDDTKKEDAKKPEAK 
121 K D D A K K A E T A G. 

An eleventh cloned gene encloding a Protein G variant comprising a Bl'/B2 hybrid binding domain 
50 (Type 11), which may be used in the practice of the invention, has the following DNA sequence: 



55 
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1C 80 30 40 5C 60 

ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGCCTTCAAA 
CAATACGCTA ACGACAACGG TGTTGATGGT GTTTGGACTT ATGATGATGC GACTAAGACC 
TTTACGGTAA CTGAAATGGT TACAGAGGTT CCTCGAGGTG ATGCACCAAC TGAACCAGAA 
AAACCAGAAG CAAGTATCCC TCTTGTTCCG TTAACTCCTG CAACTCCAAT TGCTPAAGAT 

310 320 330 340 350 3£C 

GACGCTAAGA AAGACGATAC TAAGAAAGAA GATGCTAAAA AACCAGAAGC TAAGQQPGQT 
GACGCTAAGA AAGCTGAAAC TGCCG6CTAA 



or a degenerate variant thereof. 

The Protein G variant Type 11 expressed from the eleventh cloned gene having the above disclosed 
sequence, has the following amino acid sequence: 

15 

5 10 15 £0 25 30 

1MDPYPLPKTDTYKLILNGKTLKGETTTEAV 
31 DAATAEKAFKQYANDNGVDGVWTYODATKT 
61 FTVTEMVTEVPRGDAPTEPEKPEPSI PLVP 
91 LTPATP 1 AKDDAKKDDTKKEDAKKPEAKKD 
20 121 DAKKAETAG. 

By the terms "degenerate variant" is intended DNA sequences having substitutions of bases but which 
encode the same protein. 

It will be recognized by one skilled in the art that additional deletions and structural modifications of the 

25 Protein G variant genes can be constructed by analogous methods. It will further be recognized that various 
upstream deletions and various downstream deletions can be recombined in vitro in different combinations 
to produce novel gene structures. Such recombination can be facilitated by the use of unique restriction 
endonuclease sites for Sma I (upstream from the apr promoter), Kpn I (immediately upstream from 
sequences encoding domain B1) and Hind III (downstream from the coding sequences), by assorting Sma I- 

30 Kpn I fragments and Kpn l- Hind Hi fragments. Novel combinations constructed in this way retain sequences 
encoding the B domains? andlherefore retain IgG-binding activity. Additionally, it will be recognized that the 
Sma l-Bam HI fragment of these plasmids which carries the apr promoter and signal-encoding sequences 
cifTbeleplaced by analogous fragments carrying other promoters and signal-encoding sequences that are 
active in promoting synthesis and secretion of foreign proteins in B. subtilis . 

35 Any suitable known method of protein purification may be used to recover and purify the Protein G 
variants from the host cells. The cells may be lysed, if necessary, using known chemical, physical, and/or 
enzymatic means. The Protein G then may be purified from the cell lysate using such standard procedures 
as adsorption to immobilized immunoglobulin, as described by Sjoquist, U.S. Patent No. 3,850,798 (1974), 
ion-exchange or gel chromatography, precipitation (e.g., with ammonium sulfate), dialysis, filtration, or a 

40 combination of these methods. 

By the term "solid phase support" is intended any support capable of immobilizing the Protein G 
variants of the invention, either covalently, or by adsorption. Solid phase supports which may be used for 
immobilizing the Protein G variants of the invention include, but are not limited to, polymers having hydroxy! 
groups, either free or in esterified form, such as agarose, cellulose, including cellulose esters such as 

45 cellulose nitrate, diazocellulose, cellulose acetate, cellulose propionate, and the like, and acryiamide 
polymers and copolymers, such as polyacryiamide and acryiamide, microtitre plates, glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, agar, starch, or the chemically active membrane having a large 
surface area comprising a hydrophobic, microporous, skinless, polyamide membrane which is chemically 
bound to a residue of an activating agent which is capable of immobilizing the Protein G variants of the 

50 invention. See Degen et aL , U.S. Patent No. 4,693,985 (1987). 

The Protein G variants may be immobilized to the solid phase support according to methods known to 
those of ordinary skill in the art for protein immobilization. For example, the Protein G variants may be 
coated or bonded, either covalently or by adsorption, to the solid phase. Methods for immobilizing proteins 
to solid phase supports are taught, for example, in U.S. Patent Nos. 3,652,761, 3,879,262, 3,986,217, and 

55 4,693,985. See also Examples XV and XVII of the present application. Preferably, the Protein G variants are 
immobilized to tresyl activated or cyanogen bromide activated agarose. Where the Protein G variant has 
one or more cysteine residues, a preferred method of immobilizing the Protein G variant is by coupling it to 
a maieimide-activated agarose support. This support may be prepared by treating agarose modified to 
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contain primary amino groups (in particular, AH-Sepharose, Pharmacia Co.) with sulfosuccinimidyi-4-( p - 
maieimidophenyl) butyrate. The resulting activated gel may then be treated with the Protein G variant to 
give the immobilized Protein G variant product. As disclosed in Example XVII, immobilization of cysteine 
containing variants using the maleimide-activated Sepharose derivative results in unexpectedly high levels 
of protein immobilization and unexpectedly high levels of Ig binding, especially with a mouse monoclonal 
antibody. Where the Protein G variant does not comprise a cysteine residue, the preferred method of 
immobilizing the Protein G variant is by coupling it to tresyl-activated agarose. 

The immobilized Protein G variants of the invention may be used to separate or bind- IgG or fragments 
thereof. Surprisingly, the inventors have discovered that certain immobilized Protein G variants exhibit 
unexpected binding profiles. In particular, it has been discovered that immobilized single domain variants 
bind to IgG and Fc fragments much better than to Fab and F(ab') 2 fragments. Therefore, these immobilized 
variants are useful for the separation of IgG and Fc fragments from Fab and F(ab ) 2 fragments. 

The immobilized single domain Protein G variants of the invention may be used in a method for the 
isolation of Fab or F(ab') 2 fragments from IgG and Fc fragments, comprising 

(a) contacting a sample comprising Fc fragments, IgG and at least one of Fab or F(ab ) 2 fragments with 
a single domain Protein G variant immobilized on a solid phase support; and 

(b) washing the solid phase support obtained in step (a) with a buffer of pH 5 to 8 to give the Fab and F- 
(ab') 2 fragments in the eluate and give a solid phase support with Fc fragments and IgG bound thereon. 

Examples of buffers having pH 5 to 8 include, but are not limited to acetate, phosphate, Tris, borate, 
and bicarbonate. 

The Fc fragment and IgG bound to the immobilized Protein G variant may then be recovered by 

(c) washing the solid phase support obtained in step (b) with a buffer of pH 3.5 to 2.4 to give the Fc 
fragment and IgG fragment in the eluate. 

Examples of buffers having pH 3.5 to 2.4 include, but are not limited to acetate, citrate, and glycine. 

In particular, the Type 5 Protein G variant, immobilized onto agarose, has little or no affinity for Fab or F 
(ab') 2 fragments but binds IgG and Fc fragments tightly and reversibiy. IgG and Fc fragments are bound at 
about pH 5 to 8 while Fab and F (ab') 2 fragments are not bound. IgG and Fc fragments can be eluted by 
lowering the pH to about 3.5 to 2.4. 

The inventors have also discovered that the single domain Protein G variant Type 10 immobilized to 
agarose binds IgG irreversibly (the IgG cannot be eluted by lowering the pH of the eluate), binds Fab and 
F(ab') 2 to a small extent at pH 5 to 8, and binds Fc. For example, at pH 5 to 8, Fc is bound but can be 
eluted by lowering the pH to 3.5 to 2.4. 

The Immobilized Type 10 variant, having irreversibly bound and thus immobilized an IgG specific for 
any particular target molecule, may be used in affinity chromatography. In this embodiment of the invention, 
a target molecule bound to the target molecule specific IgG may be selectively eluted without elution of the 
IgG by washing the solid phase with a buffer of pH 3.5 to 2.4. 

This aspect of the invention relates to a method for affinity chromatography of a target molecule, 
comprising 

(a) contacting a first sample comprising a target molecule-specific IgG with a solid phase support having 
Protein G variant Type 10 immobilized thereon to give IgG irreversibly immobilized to the solid phase 
support; 

(b) contacting a second sample comprising a target molecule with the solid support having the target 
molecule-specific IgG immobilized thereon obtained in step (a); 

(c) washing the solid phase support obtained in step (b) with a buffer of pH 5 to 8 to elute unbound 
solute; and 

(d) washing the solid phase support obtained in step (c) with a buffer of pH 3.5 to 2.4 to give the target 
molecule in the eluate. 

The target molecule-specific IgG antibodies may be obtained according to methods known to those of 
ordinary skill in the art. Reference is made to Kohier et aL , Nature 256 :495 (1975) for methods of 
preparing antibodies. 

The inventors have also discovered that the hybrid single domain Protein G Type 11 variant immo- 
bilized to agarose moderately binds IgG at pH 5 to 8, binds Fab to a smaller extent at pH 6 to 8, and binds 
Fc. For example, at pH 5 to 8, the Fc fragment is bound but may be eluted by lowering the pH to 3.5 to 2.4. 

Unexpectedly, the inventors have discovered that the various subclasses of IgG including IgGl, lgG2, 
lgG3, and lgG4 may be fractionated and isolated using the Protein G variant Type 11 immobilized on a 
solid phase support. 

In this embodiment, the invention relates to a method for fractionating the various subclasses of IgG, 
comprising 
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(a) applying a sample comprising at least two subclasses of IgG to the top of a column containing the 
Protein G variant Type 11 immobilized to a solid phase support; 

(b) eluting the column with a buffer of pH 8 to 2.4 over a continuous or discontinuous gradient; and 

(c) collecting fractions of eiuate containing the IgG subclasses. 

5 The various IgG subclasses may be detected in the eiuate by any means known to those of ordinary 
skill in the art. For example, the IgG subclasses may be detected with a commercially available isotyping kit 
(Amersham) or by ELISA or an immunodiffusion assay. 

Examples of buffers which may be used in a gradient of pH 8 to 2.4 include, but are not limited to 
acetate, citrate, phosphate, Tris, glycine and borate. 

10 The inventors have also discovered that cysteine-containing Protein G variants, when immobilized to 
maleimide activated agarose, provide high binding capacity for IcjG and fragments thereof. This aspect of 
the invention relates to a method for the isolation of Fab or F(ab ) 2 fragments from IgG and Fc fragments, 
comprising 

(a) contacting a sample comprising Fc fragments, IgG and at least one of Fab or F(ab ) 2 fragments with 
75 a Protein G variant comprising at least one cysteine residue immobilized to maleimide activated agarose; 

and 

(b) washing the agarose obtained in step (a) with a buffer of pH 5 to 8 to give the Fab and F(ab )z 
fragments in the eiuate and give agarose with Fc fragments and IgG bound thereon. 

The invention also relates generally to methods for separating IgG from the impurities naturally 
20 associated therewith. This aspect of the invention relates to a method for the separation of IgG from 
impurities naturally associated therewith, comprising 

(a) contacting a sample comprising IgG with a Protein G variant immobilized to a solid phase support; 

(b) washing the solid phase support obtained in step (a) with a buffer of pH 5 to 8 to give the impurities 
in the eiuate and IgG bound to the solid phase support; and 

25 (c) washing the solid phase support obtained in step (b) with a buffer of pH 3.5 to 2.4 to give IgG in the 
eiuate which is free from the impurities naturally associated therewith. 

The invention will now be described by way of example with reference to the following Examples and to 
the accompanying drawings, in which: 

Figure 1 is a diagram showing the salient features of plasmid pGX1066, a vector suitable for use in 
30 cloning a Protein G-encoding DNA fragment; 

Figure 2 shows a partial restriction map of plasmid pGX4533, a recombinant plasmid vector containing a 
Protein G-encoding DNA fragment; 

Figures 3 and 3A-C shows a DNA sequence for the Protein G gene, as well as the amino acid sequence 
encoded by the gene; 

35 Figure 4 shows the restriction map of the cloned protein G gene and the repeating structure of its protein 
product responsible for IgG-binding; 

Figure 5 shows the partial restriction map of mGX4547, a bacteriophage vector containing a Protein G- 
encoding fragment; 

Figure 6 shows the partial restriction map of mGX7880, a bacteriophage vector which contains two 
40 complete copies of the B structure and lacks all amino acid sequences distal to B1 and B2; 

Figure 7 shows a restriction map of plasmid pGX4582, a recombinant plasmid vector used to transform 
B. subtilis which contains a Protein G-encoding fragment; 

FTgures 8 and 8A-C show the location of the active sites, B1 and B2, on Protein G as coded for by the 
cloned Protein G gene derived from streptococcus GX7809; 
45 Figures 9 and 9A show the DNA and amino acid sequences for the cloned Protein G gene derived from 
streptococcus GX7805. This gene codes for a Protein G containing three active sites; 
Figure 10 shows the relationship between the repeating structures of the Protein G gene derived from 
strains GX7805 and GX7809; 

Figure 1 1 depicts construction of plasmid pGX4595 from plasmid pGX4582 wherein coding sequences 
50 upstream from the active sites sequences have been deleted; 

Figure 12 depicts construction of plasmids pGX4597 and pGX4599 from plasmid mGX7872 wherein 
coding sequences downstream from the active sites have been deleted; 

Figure 13 depicts the construction of plasmids pGX5203 (which expresses Protein G variant Type 3) and 
pGX5204 (which expresses Protein G variant Type 2) from plasmids pGX4597 and pGX4599 respectively 
55 and the plasmid pGX2606 which contains the promoter OL/PR; 

Figure 14 depicts the construction of plasmid pGX4592 which contains the gene which encodes Protein 
G variant Type 1 ; 

Figure 15 depicts the construction of plasmid pGX5245 which contains the gene which encodes Protein 

13 
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G variant Type 5; 

Figure 16 depicts the construction of plasmids pGX5247 and pGX5246 which contain genes which 
encode Protein G variants Type 6 and Type 1 1 , respectively; 

Figure 17 depicts the construction of plasmid pGX5244 which contains the gene which encodes Protein 
5 G variant Type 7; 

Figure 18 depicts the construction of piasmids pGX5255 and pGX5268 which contain genes which 
encode Protein G variants Type 8 and Type 9, respectively; 

Figure 19 depicts the construction of plasmid pGX5266 which contains the gene which encodes Protein 
G variant type 10. 

w The following Examples are provided to illustrate the invention, and are not to be construed as limiting 
the scope of the invention in any manner. 



EXAMPLE I 

75 



Cloning a Streptococcus Protein G Gene into E. coli 



20 

Streptococci of Lancefield group G were obtained from hospitals, and 11 independent isolate strains 
were derived from the clinical isolates. Each strain was assayed for ability to bind IgG using the following 
colony immunoassay procedure. The strains were streaked on L-Broth-agar plates which had been overlaid 
with a sheet of nitrocellulose and (top layer) a sheet of cellulose acetate ("immunoassay plates"). The 

25 plates were incubated at 37° C until bacterial colonies were visible on the cellulose acetate sheet. 

The nitrocellulose sheets then were removed from the plates, and IgG-binding proteins were detected 
on the sheets using an immunochemical procedure, as follows. The sheets were first treated with bovine 
serum albumin (3.0% w/v in "Tris-saline", which comprises 0.01 M Tris-HCl, pH 8.0, and 0.1 5M NaCI) to 
block nitrocellulose sites to minimize non-specific binding of antibodies to the nitrocellulose in subsequent 

30 steps. The sheets then were treated with normal rabbit serum (diluted 1:1000 in Tris-saline containing 3% 
w/v bovine serum albumin) for 1 hour at 23 °C, followed by peroxidase-conjugated goat anti-rabbit IgG 
(similarly diluted), and, finally, with 4-chloro-1-naphthol (0.6 mg/ml) and hydrogen peroxide (0.06% w/v in 
Tris-saline containing 0.2 volume methanol), washing the sheets with Tris-saline between incubation steps. 
Blue spots on the nitrocellulose sheet indicate the presence of IgG-binding protein, and the blue areas 

35 correspond to microbial colonies which produced the IgG-binding protein. 

Nine of the strains were positive, i.e. were found to bind IgG, although to varying degrees. Several of 
the strains were next tested for ability to bind lgG3, using the following hemagglutination assay. Sheep red 
blood cells (RBC) (Cappel Laboratories, Malvern, Pennsylvania) were coated with immunoglobulin essen- 
tially as described by Adler and Adler [ Meth. Enzymol. 70 :455-466 (1980)]. RBC were washed with 

40 phosphate-buffered saline (PBS, containing 8.4 g/I NaCI, 1.1 g/I Na 2 HPO;, and 0.27 g/l NaH 2 POO and 
treated for 15 min. at 37 °C with a solution of tannic acid at 2.5 mg/ml in PBS. Cells were recovered by 
centrifugation and resuspended in PBS containing, at 0.2 mg/ml, either (a) total human immunoglobulin G 
(available from Sigma Chemical Co; St. Louis, Mo.), (b) lgG3 myeloma protein or (c) PBS only. After 
incubation at 37 °C for 30 min, RBC were recovered by centrifugation and washed with PBS. For the 

45 agglutination assay, 50 ul of a 1 % suspension of coated RBC were mixed with 50 ul of a test cell extract, 
diluted serially in PBS, in a conical well of a multiwell dish. Unagglutinated RBC settle to the bottom of the 
well and form a small pellet, while agglutinated RBC form a more diffuse precipitate on the walls of the well. 

Each of the positive group G Streptococcal strains agglutinated lgG3-coated erythrocytes as efficiently 
as erythrocytes coated with unfractionated IgG, which is expected for Protein G-producing strains. In 

so contrast, Staphylococcus aureus Cowan I ceils, a strain which produces Protein A, agglutinated red blood 
cells coated with unfractionated IgG, but showed no activity toward lgG3-coated cells, as expected. None of 
the cells agglutinated red blood cells which had been incubated with PBS only, i.e., uncoated red blood 
cells. 

The same hemagglutination assay then was performed on supernatant fractions and cell extracts from 
55 cultures of the Streptococcus isolates and the isolates appeared to have differing localization of the IgG- 
binding activity. In some strains the activity appeared to be predominantly cell-bound, in some it was found 
predominantly in the culture supernatant, and some strains were intermediate. Three strains, which had 
. differing localization of the IgG-binding activity, were chosen as sources of DNA for cloning the Protein G 
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gene. 

Cells from each strain were cultivated in 250 mis. of Todd-Hewitt broth (commercially available from 
Fisher Scientific, Richmond, Va.) containing 20mM D,L-threonine. After 4 hours of cultivation, glycine was 
added to a final concentration in the culture medium of 5% (w/v). The cells were harvested by centrifugation 

5 after 5 hours of cultivation, when the cell density had reached an absorbance at 600nm of about 0.5 to 1.0. 
The cell pellets were washed with PBS and then frozen in liquid nitrogen and stored at -70° C. After 
thawing, the ceils were washed with, and then resuspended in, 10 mis of S7 medium [described by 
Vasantha and Freese, J. Bacteriology 144 :1 11 9-1 125 (1980)] containing 0.5M sucrose, to which 200 ul of 5 
mg/ml mutanoiysin (commercially available from Sigma Chemical Co.) had been added. Following incuba- 

w tion at 37° C for 45 minutes, the resulting protoplasts were pelleted by centrifugation, and then lysed 
osmotically by resuspension in a solution containing 100mM EDTA, pH 8.0, 150mM NaCI, and 0.5 mg/ml 
Proteinase K. Following incubation at 37 °C for 55 minutes, aipha-toluenesulfonyl fluoride (also called 
phenylmethanesulfonyl fluoride or PMSF, and available commercially, e.g. from Sigma) was added to a final 
concentration of 2mM, and the mixture was incubated at 70° C for 15 minutes to inactivate the Proteinase K. 

75 The cell lysate was extracted three times with chloroform/isoamyl alcohol (24:1) to further remove proteins, 
and an equal volume of isopro panol was added to the aqueous phase to precipitate the DNA. The 
precipitated DNA was collected by winding on a spool, and then was washed with 70% ethanol and dried in 
vacuo . The DNA pellets (from each of the 3 strains) were each resuspended in 0.5 ml of a 0.01 M Tris-HC! 
(pH~7.8)/1mM EDTA/0.05 M NaCI solution. A portion of this isolated chromosomal DNA was partially 

20 digested with the restriction endonuclease Mbo I (commercially available) by adding 2 units of Mbo I to 25 
ul of the resuspended DNA in 100 ul of a buffer containing 100 mM Tris-HCI, pH 7.8, J 50 mM NaCI and 
10mM MgCI 2 - The reaction mixture was incubated at 37° C for 13 minutes, then at 70* C for 10 minutes. 
The digested DNA was subjected to electrophoresis on a 0.8% agarose gel for 15 hours at 0.35 volts/cm. 
The section of the gel containing DNA fragments between about 4 and 9 kilobase-pairs (kbp) in length was 

25 excised from the gel and crushed to aid in recovery of the DNA. An equal volume of H 2 0-saturated phenol 
was added to the crushed gel portion, and the mixture was frozen at -70 °C for 1 hour. Without prior 
thawing, the mixture was centrifuged at room temperature for 15 minutes in an Eppendorf microfuge, and 
the aqueous phase was extracted twice with an equal volume of phenol and once with an equal volume of 
phenoi/chloroform/isoamyl alcohol (25:24:1). The DNA was precipitated from the aqueous phase by adding 

30 2.5 volumes of 95% ethanol and 30 ug glycogen as a carrier. 

The cloning vector into which the chromosomal DNA fragments were inserted was plasmid pGX1066, 
shown in Figure 1. This plasmid comprises a bank of closely-spaced restriction endonuclease recognition 
sites useful for insertion of DNA fragments to be cloned. The bank of cloning sites is bordered by two 
transcription terminators. E. coli strain GX1186, which constitutes strain GX1170 transformed by plasmid 

35 pGX1066 has been deposited with the ATCC as No. 39955, 3 ug of plasmid pGX1066 DNA were digested 
with the restriction endonuclease Bam HI (commercially available and used according to manufacturer's 
specifications.) The digested plasmid DNA then was treated with 1 unit calf intestine alkaline phosphatase, 
(obtained from Boehringer-Mannheim and used according to manufacturer's specifications) for 30 minutes 
at 37° C. Following extraction of the reaction mixture with phenoi/chloroform/isoamyl alcohol (25:24:1), the 

40 DNA was precipitated by adding 0.1 volume 2M sodium acetate, 10mM EDTA, and 2.5 volumes 95% 
ethanol and 10 ug glycogen as a carrier. 0.5 ug of the pGX1066 vector DNA ( Bam Hl-digested and 
phosphatase-treated) then was ligated to 0.2 ug of the partially Mbo I digested Streptococcus chromosomal 
DNA prepared above. The 10 ul reaction mixture contained 1 unit of T4 DNA ligase (commercially available 
and used according to manufacturer's instructions) and was incubated at 4° C for 20 hours. 

45 E. coli SK2267 (F~ gal thi T1 r hsdR4 endA sbcB15 , available from the E. coli Genetic Stock Center, 
YaieTlniversity, New HavenTConn.) cells were made competent for transformation by standard calcium 
chloride treatment, and 0.25 ml of the competent cells then were mixed with 20 ul of the ligation mixture in 
a standard transformation procedure [Lederberg and Cohen, J. Bacterid. 119 :1 072-1 074 (1974)]. The cells 
then were pelleted by centrifugation and resuspended in 0.3 ml L Broth. 0.1 ml of cells then were plated on 

so each of three L Broth-agar plates containing 100 ug/ml ampicillin, which had been overlaid with a sheet of 
nitrocellulose and (top layer) a sheet of cellulose acetate (immunoassay plates). The plates were incubated 
at 37° C until bacterial colonies were visible on the cellulose acetate sheet. 

The nitrocellulose sheets then were removed from the plates, and IgG-binding proteins were detected 
on the sheets using the immunochemical procedure described above. The sheets were first treated with 

55 bovine serum albumin (3.0% w/v in tris-saline) to block nitrocellulose sites to minimize non-specific binding 
of antibodies to the nitrocellulose in subsequent steps. The sheets then were treated with normal rabbit 
serum, diluted 1:1000 in Tris-saline containing 3.0% w/v bovine serum albumin, for 1 hour at 23* C, followed 
by peroxidase-conjugated goat anti-rabbit IgG (diluted similarly), and, finally, with 4-chloro-1-naphthol (0.6 
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mg/ml) and hydrogen peroxide (0.06% w/v in Tris-saline containing 0.2 vol. methanol), washing with tris- 
saline between incubation steps. 

One positive colony was identified, and was located on a plate containing transformants derived from 
Streptococcus strain GX7809 (one of the three Streptococcus strains from which DNA was isolated for 
5 cloning.) The positive colony was streaked out on an immunoassay plate (containing 100 ug/ml ampicillin, 
as above) to obtain a purified transformant strain. The nitrocellulose sheet was processed as above, and 
only a few positives were found among hundreds of negative colonies. It appeared that the original 
transformant was unstable, so the restreaking process was repeated, and only one positive was found 
among hundreds of negative colonies. Another round of restreaking produced a plate containing mostly 
w positive colonies. One of the positive colonies, a derivative which was apparently more stable than the 
original positive transformant, was isolated and designated E. coli strain GX7820. Samples of E. coii 
GX7820 have been deposited at the American Type Culture Collection in Rockville, Maryland and given the 
accession number ATCC No. 53460. 

In addition to the original positive colony, several small but strongly positive spots, which could not be 
75 correlated with any colony, were observed. These spots yielded no positive progeny on restreaking. 

A standard procedure was used to isolate plasmid DNA from E. coli GX7820, and the plasmid DNA was 
analyzed by restriction analysis followed by gel electrophoresis. fhe~strain was found to contain two types 
of plasmids. One plasmid (designated pGX1066X) appeared to be the same size as the pGX1066 cloning 
vector, while the other (designated pGX4530) apparently was pGX1066 containing an 11 kbp insert. 
20 Competent E. coli SK2267 cells then were retransformed with the mixture of plasmids isolated from 
GX7820, ancHranTformants were selected on immunoassay plates containing 100 ug/ml ampicillin. Positive 
transformants of two types were obtained. A majority formed tiny, strongly positive colonies, most of which 
could not be propagated. A minority resembled GX7820 in being of more normal size and more easily 
propagatable. In order to clarify the cause of these results, competent E. coli SK2267 cells were also 
25 transformed with gel purified plasmids, as follows: 
Transformation A: pGX4530 alone 
Transformation B: pGX1 066X alone 
Transformation C: mixture of pGX4530 and pGX1066X 
The results were as follows: 
30 Transformation A: tiny, strongly positive colonies, most of which could not be propagated. 
Transformation B: no transformants 

Transformation C: many tiny, strongly positive non-propagatable colonies (as in trans. A) with about 20% 
of the positives resembling GX7820, i.e. of normal size and propagatable. 

Several of the tiny, strongly positive colonies were chosen from the retransformation plates above (i.e., 

35 the transformants resulting from transformation of E. coH with an unfractionated plasmid preparation derived 
from strain GX7820 comprising a mixture of pGX1066X and pGX4530) and were restreaked to isolate 
propagatable strains. Plasmid DNA was isolated from two strains, and both were found to have lost the 
pGX1066X helper plasmid. One strain (designated E. coli GX7823) contained a plasmid pGX4533 in which a 
deletion of about 2 kbp had occurred in the 1 1 kbp insert found in pGX4530. Samples of E. coli GX7823 

40 have been deposited at the American Type Culture Collection in Rockville, Maryland and given the 
accession number ATCC No. 53461. The second strain (designated E. coli GX7822) contained a plasmid 
pGX4532 which had acquired an additional 3 kbp of unidentified DNA inserted within the original 11 kbp 
insert, at a site very close to one end of the deletion in pGX4533. 

The strains E. coii GX7823 (containing pGX4533) and E. coli GX7820 (containing pGX4530) were 

45 cultivated in L-BrottilJus ampicillin. The cells were pelleted by centrifugation, lysed by incubating for 30 
min at 37* C in the presence of 0.5 mg/ml lysozyme in a buffer containing 50 mM EDTA, pH 8.0, and 2mM 
PMSF. Samples of the extracts were prepared for electrophoresis by heating for 5 min. at 100° C in the 
buffer described by Studier [ J. MoT. BioL 79 :237-248 [1973)], and the samples were subjected to 
electrophoresis on a 12.5% acrylimld^SDS gel as described by Studier, op cit , to separate the proteins. 

so A standard electrophoretic (Western Blotting) technique was used to transfer the protein bands from the gel 
to nitrocellulose paper. The nitrocellulose was subsequently incubated (in sequence) with BSA, normal 
rabbit serum, peroxidase-conjugated goat anti-rabbit IgG, and 4-chloro-1-naphthol plus H 2 0 2 (the same 
nitro-cellulose treatment as the immunochemical procedure described above). Both strains were found to 
produce the same IgG-binding protein bands with mobilities corresponding to molecular weights between 

55 approximately 90,000 to approximately 30,000 with a predominant band at 57,000. 

Plasmid pGX4533 was subjected to restriction analysis, and a partial restriction map is shown in Figure 
2. The single line represents the vector (pGX1066) sequences, while the hatched area represents the DNA 
that has been inserted into the plasmid vector and which contains the Protein G gene. 
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The 1 9 kbp Hin dill fragment in the insert was subcloned into pGX1066, and the resulting recombinant 
plasmid (pGX4547)~ was transformed into E. coli . Western blotting of the proteins produced by this 
transformant ( E. coli GX7841) was done as described above, and the same IgG-binding protein bands were 
present including Thi predominant 57,000 band. The transformant was also analyzed in a hemagglutination 
assay, as described above. Extracts of the transformant agglutinated tanned sheep erythrocytes coated with 
IgG3 (human myeloma protein) and with unfractionated human IgQ, but uncoated erythrocytes were not 
agglutinated. An extract from a Protein A-producing E. coli strain agglutinated the erythrocytes coated with 
unfractionated IgQ, but not those coated with lgG3 or uncoated erythrocytes. A control E. coli strain which 
produced neither Protein A nor Protein G failed to agglutinate any of the erythrocyte samples. 

These results demonstrate that E. coli strains GX7841, GX7820, and GX7823 produce IgG-binding 
protein having the properties which are characteristics of Protein G. 



EXAMPLE II 

75 



DNA and Amino Acid Sequence Data 



20 



25 



The DNA sequence of the cloned gene was determined. This sequence is shown in Figure 3, along with 
the amino acid sequence specified by the DNA sequence. The data in Figure 3 are for the entire l.9kbp Hin 
dlli fragment which contains the cloned Protein G gene, as described above. 

It will be appreciated that because of the degeneracy of the genetic code, the nucleotide sequence of 
the gene can vary substantially. For example, portions or all of the gene could be chemically synthesized to 
yield DNA having a different nucleotide sequence than that shown in Figure 3, yet the amino acid sequence 
would be preserved, provided that the proper codon-amino acid assignments were observed. Having 
established the nucleotide sequence of the Protein G gene and the amino acid sequence of the protein, the 
gene of the present invention is not limited to a particular nucleotide sequence, but includes all variations 
30 thereof as permitted by the genetic code. 

The Protein G protein of the present invention is not limited to a protein having the exact amino acid 
sequence shown in Figure 3. A protein comprising deletions or substitutions in the sequence shown in 
Figure 3 or additional amino acids at the amino or carboxyl terminus of the protein, are included in the 
present invention as long as the protein retains the desired IgG-binding properties of Protein G, described 
35 above. These variations in amino acid sequence may be achieved by chemical synthesis of the gene, or by 
known in vitro mutagenesis procedures, for example. 

Thefollowing abbreviations are used in Figure 3: 
A = deoxyadenyl 
T = thymidyl 
40 G = deoxyguanyl 
C = deoxycytosyl 
GLY = glycine 
ALA = alanine 
VAL = valine 
45 LEU = leucine 
ILE = isoleucine 
SER = serine 
THR = threonine 
PHE = phenylalanine 
so CYS = cysteine 
MET = methionine 
ASP = aspartic acid 
GLU = glutamic acid 
LYS = lysine 
55 ARG = arginine 
HIS = histidine 
PRO = proline 
TYR = tyrosine 
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TRP = tryptophan 
GLN = glutamine 
ASN = asparagine 

5 

EXAMPLE III 



io Identification of the Portions of the Protein G Molecule Responsible for the IgG-binding Activity 



By examining the IgG binding activity of protein produced by E. coli strains carrying deleted and 
modified forms of the cloned protein G gene, the activity was localized" tolhe repeating structure between 
15 amino acid residues 228 and 352 (Figure 8). The amino acid sequences of regions B1 and B2 are identical 
at 49 of the 55 corresponding positions in each. This repeating structure is illustrated in figure 4, where it is 
indicated as B1 and B2. 

The 1.9 kbp Hin dill fragment indicated in figure 2, which contains the entire coding sequence for 
protein G, originally" isolated from Streptococcus GX7809, was subcloned in bacteriophage M13mp9 

20 [Messing, J., Methods Enzymol. 101 :20 (1983)]. The plasmid pGX4547 was digested with endonuclease 
Hin dill, as was the double stranded replicative form of bacteriophage M13mp9 DNA. The latter was also 
treated with calf alkaline phosphatase (2 units in 15 ul), which was present during the digestion with Hin dill, 
to prevent recircularization of the vector. After extraction with phenol and precipitation with ethanol, thi two 
digested DNA preparations were mixed and incubated with DNA ligase under ligation conditions. The 

25 ligated DNA preparation was used to transfect E. coli strain GX1210 (F' traD36 proA + B+ laclq/delta- 
IacZM15 delta-(lac-pro) supE thi zig::Tn10 hsdR2)Tfrarisfected cells from plaques were screened for the 
production of protein G by colony immunoassay. One which produced a positive assay response was 
designated mGX4547, and was shown to have the partial restriction map illustrated in figure 5. 

Double stranded replicative form DNA isolated from E. coli infected with mGX4547 was digested with 

30 endonuclease Pst I. After extraction with phenol and ethanol "precipitation, the digested DNA was incubated 
with DNA ligase~under ligation conditions in dilute solution (approximately 5 ug digested DNA per ml). The 
religated DNA preparation was then used to transfect E. coN GX1210. Replicative form DNA was prepared 
from cells infected from several plaques, and the same iinfected cells were assayed for the production of 
IgG-binding protein by colony immunoassay. Several clones were found by analysis of RF DNA with 

35 restriction endonuclease Pst I to have lost both the 210 bp and the 415 bp Pst I fragments indicated in 
figures 5 and 6. These clones produced no active IgG-binding protein, as indicated by colony immunoas- 
say. The truncated protein produced by these clones would be expected to contain only a portion of the 
structure B1, and lack all amino acid sequences distal to B1. 

One of the clones obtained from the above transfection produced a positive result by colony 

40 immunoassay. Restriction analysis of RF DNA from this clone revealed that the phage DNA lacked the 415 
bp Pst I fragment, but retained the 210 bp fragment. Furthermore, the relative intensity of the 210 bp Pst I 
frag~ment band on an ethidium bromide-stained agarose gel electrophoretogram suggested that the DNA 
carried two copies of the 210 bp fragment DNA sequencing confirmed that the structure of this phage DNA 
(mGX7880) was as illustrated in figure 6. The protein encoded by the protein G gene carried on this phage 

45 DNA would be expected to contain two complete copies of the B structure, an intact B1 sequence followed 
by a chimera of B1 and B2. It would lack all amino acid sequences distal to B2. This structure results from 
the fact that the Pst I sites which define the 210 bp fragment are located in the B repeating structures at 
positions corresponding to homologous sequences, and in the same relation to the reading frame of the 
protein. Polyacrylamide gel electrophoretic analysis revealed that E. coli bearing this DNA produced a 

50 protein with IgG-binding activity of approximately the expected sizeTFahnestock, et ah , J. Bacteriol. 167 
:870-880 (1986)]. ~ ~ 

These results indicate that the presence of the B repeated structure is a necessary and sufficient 
condition for IgG-binding activity of protein G. It was therefore concluded that the B repeating structure was 
the locus of IgG-binding activity in the molecule. 

55 

EXAMPLE IV 
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Expression of the Protein G Gene in Bacillus Subtilis 



A synthetic oligonucleotide with a sequence resembling a transcription terminator was first inserted into 
mGX4547. The sequence of the oligonucleotide was: 
5' -pTCGAAAAAAGAGACCGGATATCCGGTCTCTTTTT-3 

It is self-complementary, and when double-stranded, produces single stranded ends with the same 
sequence as those produced by endonuclease Sal I. To insert it into mGX4547, the phage DNA was 
digested with endonuclease Sal I, phenol extracted and ethanol precipitated, then incubated with the 
synthetic oligonucleotide (which had been denatured by heating to 70 °C and slowly cooled to 23 C) and 
DNA ligase under ligating conditions. Ligated DNA was used to transfect E. coli GX1210, and clones were 
screened for the loss of the Sal I site and appearance of an Eco RV site, the recognition sequence for 
which is present on the synthetic oligonucleotide. One clone with the desired structure was designated 
mGX7872. 

Next, sequences distal to the B2 repeated sequence were deleted from mGX7872. This was accom- 
plished by oligonucleo tide-directed in vitro mutagenesis. The following oligonucleotide was synthesized: 
5 , -pCGTTTTGAAGCGACCGGAACCTCTGTAACC-3 / . 

This sequence is complementary on one half to sequences in mGX4547 immediately distal to the B2 
sequence, and on the other half to sequences near those coding for the C-terminus of protein G. This 
oligonucleotide was used as a primer for the in vitro synthesis of double stranded RF DNA, with mGX4547 
DNA as template, using standard methods. This DNA was used to transfect E coli GX1210. Plaques were 
screened in situ for the ability of phage DNA they produced to hybridize to the radioactive oligonucleotide 
5 / -(32P)AGCGACCGGAACCTC-3' J which is complementary to the desired deleted sequence. One clone 
with the desired structure was identified and designated mGX7877. Its structure was verified by DNA 
sequence analysis. The deletion encompasses nucleotides 1651-1896 of the sequence shown in figure 3. 

In order to allow fusion of the protein G coding sequence to an expression and secretion vector, a Bam 
HI site was created in the sequence of mGX7877 by oligonucleotide-directed in vitro mutagenesis. A primer 
oligonucleotide, with the sequence, 

5' -pGGTATCTTCGATTGGATCCGGTGAATCAACAGCGAATACCG-3 , 

was used to promote conversion of mGX7877 single stranded DNA to duplex DNA in vitro . This 
oligonucleotide was complementary to the sequence encoding protein G in mGX7877, but includes an 
additional 6 nucleotides, GGATCC, which comprise the recognition sequence for endonuclease Bam HI, 
inserted near the beginning of the sequence encoding mature protein G (the product of removal of the 
secretion signal sequence). The resulting double stranded DNA was used to transfect E. coli GX1210. The 
RF DNA recovered from cells infected from plaques was screened for the presence of the Bam HI site. One 
with the desired structure was designated mGX8402. 

A secretion vector containing the promoter and secretion signal sequence derived from a Bacillus 
amyloliquefaciens gene encoding subtiiisin (apr) has been described by Vasantha and Thompson [ J. 
Bacterid. 165 :837 -842 (1984); and U.S. Patent Application, Serial No. 618,902, filed June 8, 1984, and the 
continuati o~nTn-part thereof, Serial No. 717,800, filed March 29, 1985]. This vector, pGX2134, contains a 
Bam HI site near the end of sequences encoding the secretion signal sequence, to which heterologous 
^ines can be fused in order to promote their expression in B. subtilis , and the secretion of the protein 
product from the ceil. In order to fuse protein G-encoding sequences to this vector, pGX2134 DNA was 
digested with endonucleases Bam HI and Pyu II. The RF DNA from mGX8402 was digested with 
endonucleases Bam HI and Sma I. After extraction with phenol and precipitation with ethanol, the digested 
DNA preparationswere mixed and incubated with DNA ligase under ligation conditions, and the ligated DNA 
was used to transform B. subtilis GX8008 (apr deletion, npr deletion, spoOA677) protoplasts by standard 
methods. Transformants"^werelelected for resistance to chloramphenicol and screened for production of 
protein G by colony immunoassay. A positive transformant was identified and designated GX8408 
(pGX4582). The plasmid pGX4582 was shown to have the structure indicated in figure 7 by restriction 
analysis. It was presumably formed by insertion into pGX2134, between the Bam HI and Pyu II sites, of the 
Bam HI fragment of mGX8402 bearing the protein G coding sequences, plus the small Bam Hl-Sma I 
fragment which is distal to the coding sequences in mGX8402. 

Strain GX8408 was shown to produce a protein with the IgG-binding activity of protein G. This strain, 
where sequences distal to the B2 sequences are deleted, exhibited enhanced secretion of protein G-like 
material. After growth in appropriate media [Fahnestock and Fisher, J. Bacterid. 165 :796-804 (1984)], 
culture supernatants and cell-associated fractions were recovered and subjected to sodium dodecyl sulfate- 
polyacrylamide gel electrophoretic analysis. After electrophoretic separation, protein bands were transferred 
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to nitrocellulose and stained immunochemical^ as described in Example I. Materia! with IgG-binding activity 
was found in both the culture supernatant and cell-associated fractions. 



EXAMPLE V 



Cloning the Gene Encoding Protein G From Streptococcus GX7805 

w 

Chromosomal DNA was isolated from a group G Streptococcus clinical isolate designated GX7805 as 
described in Example I. A sample of the DNA was digested with restriction endonuclease Hin dill and 
subjected to electrophoresis in a 1% agarose gel under standard conditions. The usefulness of Hin dill for 

15 this purpose was indicated by restriction analysis of pGX4533 (Example I and Figure 2) whereby it was 
determined that a Hin dill site separated the protein G gene from adjacent downstream sequences which 
were shown to be responsible for preventing establishment of larger fragments on multicopy plasmids in E. 
coli . After electrophoresis, DNA fragments were transferred to nitrocellulose as described by Southern J. 
Moi. Biol. 98 :503 (1975). A band of approximately 2.4 kbp containing protein G-encoding sequences was 

20 iocated~by1iybridization with a radioactive probe consisting of the 1.9 kbp Hin dill fragment indicated in 
Figure 2, originally isolated from Streptococcus strain GX7809. The 1.9 kbp fragment probe was purified by 
agarose gel electrophoresis and eluted from the gel as described in Example I, then radioactively labeled 
with 32P by nick translation essentially as described by Rigby, et aK J. MoL Biol. 113 :237 (1977). 
Hybridization was carried out essentially as described by Wahl et aL [ Proc. Natl. Acad. Sci. USA 76 :3683- 

25 3687 (1979)]. After hybridization and washing to remove unhybridized probe, a radioactive band was located 
by autoradiography at a position corresponding to a length of 2.4 kbp. 

A larger sample of the same GX7805 chromosomal DNA (6 ul) was digested with endonuclease Hin dill, 
and the fragments were separated by electrophoresis in a 1% agarose gel (16 h at 0.35 volts/cm). After 
staining with ethidium bromide, portions of the gel containing bands of length 2-3 kbp (located relative to a 

30 standard consisting of endonuclease Hin dill-digested bacteriophage lambda DNA) were excised and 
crushed to aid in the recovery of the~DNA. The DNA was recovered after extraction with phenol as 
described in Example I. 

Piasmid vector pGX1066 DNA (1 ug) was digested with endonuclease Hin dill. Following extraction of 
the reaction mixture with phenol/chloroform/isoamyi alcohol (25:24:1), the DNA was precipitated by adding 

35 O.t volume 4M LiCi, 10mM EDTA, 20 ug glycogen carrier, and 2.5 vol. 95% ethanol. 0.4 ug of the digested 
vector DNA was incubated with the recovered Hin dill fragments of GX7805 DNA (90% of the material 
recovered from 6 ug chromosomal DNA) and ~T4 DNA ligase (International Biotechnologies, Inc., New 
Haven, CT), under ligation conditions as recommended by the manufacturer, in 20 ul, for 16 h at 15 C. . 
E. coli SK2267 cells were transformed with 15 ul of the ligated DNA as described in Example I, and the 

40 transformed cells were plated on colony immunoassay plates and assayed for the production of 
immunoglobulin-binding protein as described in Example I. A positive colony was identified. Piasmid DNA 
isolated from this transformant was found to consist of pGX1066 with a DNA insert of 2.4 kbp. An 
endonuclease Hin dill fragment comprising the insert was subcloned in a bacteriophage M13mp9 vector. 
The DNA sequince of the 2.4 kbp Hin dill fragment was determined, and is presented in Figure 9. 

45 

EXAMPLE VI 



50 

Construction of Preferred Forms of the Protein G Gene for Secretion by B. subtilis 



Preferred forms of Protein G are encoded by genes from which coding sequence upstream and 
55 downstream from the active B repeats (Example 111) have been deleted. Such proteins, which exhibit the 
immunoglobulin binding activity of protein G, have enhanced stability toward proteolysis. 



20 



EP 0 408 178 A1 



A. Deletion of Upstream Sequences 

In order to accomplish the deletion of upstream sequences, plasmid pGX4582 (Example IV) was first 
modified to contain a unique cleavage site for restriction endonuclease Kpn I (see Figure 11). DNA of 
pGX4582 was digested with endonuclease Pvu II, which cuts at two sites, both upstream from the A 
repeats. After phenol extraction and ethanol precipitation, the linear plasmid fragment was then recir- 
cularized in the presence of a s'-phosphorylated self-complementary oligonucleotide with the sequence 5 - 
P-CTGGGTACCCAG, which carries a recognition site for endonuclease Kpn I, by treatment with T4 DNA 
ligase under ligation conditions. The ligated DNA was used to transform E. coli SK2267, and an ampicillin 
resistant transformant was isolated which contained a plasmid of the desired structure, which had acquired 
a unique Kpn I site. This plasmid was designated pGX4590. 

Next, "the modified Protein G gene carried on pGX4590 was transferred to a bacteriophage M13 vector. 
DNA of pGX4590 was digested with endonucleases Sma I and Sal I, which excise a fragment containing the 
entire Protein G encoding gene. Double stranded RF DNA of bacteriophage M13mp8 was also digested 
with Sma I and Sal I, and both digested DNA preparations were extracted with phenol and precipitated with 
ethanoTThe two~preparations were then mixed and incubated with T4 DNA ligase under ligation conditions. 
The ligated DNA was used to transfect E. coli GX1210, and clones were screened for the presence of an 
insert fragment of the appropriate size. A clone containing the desired fragment was identified and 
designated mGX8434. 

A second Kpn I site was then created in the Protein G coding sequence carried on mGX8434, using the- 
techniques of oligbnucleotide-directed mutagenesis. An oligonucleotide was synthesized with the sequence 
5' -GTCAGTCTTAGGTAATGGGTACCCAGCTAAAATTTCATCTATCAG 

which is complementary to sequences carried on mGX8434 adjacent to those encoding domain B1 , but 
carries a six-nucleotide insertion which comprises a recognition site for endonuclease Kpn I. With single 
stranded DNA isolated from phage mGX8434 as template and the above defined oligonucleotide as primer, 
double stranded RF DNA was synthesized in vitro using standard methods. This DNA was used to transfect 
E. coli GX1210, and clones were screened for the presence of a second Kpn I site. A clone with the desired 
itructure was identified and designated mGX8441. 

DNA of mGX8441 contains two Kpn I sites, both created as described above, in such a manner that 
deletion of the sequences between them will create an in-frame fusion of sequences upstream from the first 
site and those downstream from the second. This deletion was accomplished by digesting RF DNA of 
mGX8441 with endonuclease Kpn I, phenol-extracting and ethanol-precipitating the digested DNA, then 
recircularizing the larger RF fragment by incubating in dilute solution with T4 DNA ligase under ligation 
conditions. The ligated DNA was used to transfect E. coli GX1210, and clones were screened for loss of the 
small Kpn I fragment. A clone with the desired structure was designated mGX8442. The structure of 
mGX8442~at the site of deletion was verified by DNA sequencing. It encodes a protein predicted to be the 
same as that encoded by pGX4582 from its N-terminus through amino acid residue Ala 38 of Protein G, 
followed by the sequence GlyTyrPro, encoded by the Kpn I recognition sequence, then the sequence 
LeuProLysThrAsp (preceding domain B1) of pGX4582, and the remainder of the coding sequence of 
pGX4582. 

In order to establish the coding sequence of mGX8442 in B. subtilis , the plasmid pGX4370 was used 
as a vector. This plasmid is similar to the plasmid pGX4312, which is described by P. Bryan et al . in U.S. 
Patent application 828,545 (filed February 12, 1986), except that it contains additional sequences derived 
from a B. amyloliquefaciens subtilisin-encoding gene. These sequences comprise a promoter, translation 
initiation~~sequences, and sequences encoding the secretion signal sequence, followed by a Bam HI 
recognition sequence. They are derived from pGX2134, which is described in Example IV. The vector also 
carries replication origins active in both B. subtilis and E. coli , and markers which can be selected in both 
organisms (kanamycin resistance in B. subtilis and ampicillin resistance in E. coli ). Plasmid pGX4370 DNA 
was digested with endonucleases Bam HI and Hind HI, phenol extracted and ethanol precipitated. Similarly, 
mGX8442 RF DNA was digested with endonucleases Bam HI and Hind III, phenol extracted and ethanol 
precipitated. The digested DNA preparations were mixed and incubated with T4 DNA ligase under ligation 
conditions. The ligated DNA was used to transform E. coli SK2267, and an ampicillin resistant transformant 
with the desired structure was identified by restriction analysis and designated pGX4595. Plasmid pGX4595 
DNA was then used to transform protoplasts of B. subtilis GX8008. A kanamycin resistant transformant, 
designated GX8446, was shown, by analysis similar to that outlined in Example IV above, to produce 
protein with the immunoglobulin binding activity of Protein G, and this protein could be detected in both the 
extracellular medium and the cell lysate fractions. 
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B. Deletion of Downstream Sequences 

The starting point for constructions from which downstream sequences were variously deleted was a 
derivative of mGX7872 (Example IV) into which a site for endonuclease Xho I was inserted by 
5 oligonucieotide-directed mutagenesis following sequences encoding domain B2 (see Figure 12). For this 
purpose, an oligonucleotide was synthesized with the sequence 
5' -CAGTTGGTGCATCACCTCGAGGAACCTCTGTAACC 

which is complementary to Protein G encoding sequences on mGX7872 immediately downstream from 
those encoding domain B2, but which contains a three-nucieotide insertion which creates an Xho I site. With 
70 single stranded DNA isolated from mGX7872 as template, and this oligonucleotide as primer, double 
stranded RF DNA was synthesized in vitro by standard methods, and used to transfect E. coli GX1210. 
Clones were screened for the presence of an Xho I site, and one with the desired structure was designated 
mGX7875. The altered sequence in mGX7875~was verified by DNA sequencing and shown to be as 
desired. 

75 A Kpn I site was then inserted into mGX7875 close to sequences encoding domain B1 , at the position 
analogoui" to the second Kpn i site inserted into mGX8434, which is described above. The same 
oligonucleotide, and methods~analogous to those described above (for construction of mGX8441) were 
used. A clone with the desired structure was identified and designated mGX8447. 

Downstream sequences derived from mGX8447 were then used to replace those present in pGX4595. 

20 For this purpose, RF DNA of mGX8447 and DNA of pGX4595 were separately digested with endonucleases 
Kpn I and Hind III, phenol extracted and ethanol precipitated. The digested DNA preparations were mixed 
Ind incubatecTwith T4 DNA iigase under ligation conditions. The ligated DNA was used to transform E. coli 
SK2267, and an ampicillin resistant transformant carrying a plasmid with the desired structure was identified 
by restriction analysis, and designated pGX4596. This plasmid is similar to pGX4595, except that 

25 sequences encoding the active portion of Protein G and downstream sequences (from the Kpn I site 
preceding B1 to the Hind Hi site following the Oterminal coding sequences) are derived from mGX8447. 

Plasmid pGX4596 was then used as starting point for various deletions involving downstream se- 
quences. First, pGX4596 DNA was digested with endonuclease Xho I, which cuts at the site created by 
oligonucieotide-directed mutagenesis, described above, near the downstream end of sequences encoding 

30 domain B2, and Hind III, which cuts downstream from the end of the coding sequence. This digested DNA 
was subjected to preparative electrophoretic fractionation on a 0.7% agarose gel, and the larger of the two 
resulting fragments was eluted from the gel and recovered. A double stranded oligonucleotide adapter was 
constructed with the following structure: 

35 5'P-TCGATCGTGCTAA 

AGCACGATTTCGA-5'P 

The purified large Xho l-Hind 111 fragment derived from pGX4596 was ^circularized in the presence of 
4Q this adapter, the single~stranded ends of which are complementary to the single stranded ends of the large 
fragment This was accomplished by incubation of the fragment and adapter together in the presence of T4 
DNA Iigase under ligation conditions. The ligated DNA was used to transform E. coli SK2267, and an 
ampicillin resistant transformant carrying a plasmid with the desired structure was identified by screening 
for the presence of the endonuclease Pvu I site which is present in the adapter sequence. This plasmid was 
„ designated pGX4597. It carries a truncated Protein G gene with the following designed structure 

45 

• . • [B2]MetValThrGluValProArgSerCysEND 

. . . ATGGTTACAGAGGTTCCTCGATCGTGCTAAAGCTT . . . 

50 

The C-terminal Cys residue encoded by the synthetic adapter in this construction is unique in the Protein G 
sequence encoded by this gene. One skilled in the relevant art will recognize that such a unique residue 
can provide a favorable site to which chemical reactions can be directed, such as those designed to 
immobilize the protein on a solid matrix or to attach various chemical functionalities or other proteins to it. 
55 A second downstream deletion encompassed sequences downstream from repeat C5. This construction 
made use of an endonuclease Xmn I site derived from the sequence at residue 1815 of the DNA sequence 
shown in Figure 8. Plasmid pGX4596 contains a total of four Xmn I sites. In order to obtain molecules 
cleaved only at the desired site in the Protein G coding sequence, pGX4596 DNA (5 micrograms per ml) 
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was subjected to partial digestion with Xmn I (100 units per ml) in the presence of ethidium bromide (40 
micrograms per ml), for 60 min at 37 C. Under these conditions, digestion is mainly limited to a single 
cleavage, which results in a mixed population of linear molecules generated by cleavage at any one of the 
four sites. This digested DNA was extracted with phenol/chloroform/isoamyl alcohol (25:24:1) and precipitat- 
ed with ethanol, removing both the ethidium bromide and Xmn I enzyme. The mixed linear DNA was then 
digested with Hind III. The Hind HI site is located closer to the Xmn I site in the Protein Q coding sequence 
than to any oflhe other XmrTl sites. The largest Hind Ill-Xmn I fragment generated by digestion of the 
mixed Xmn I linear population with Hind III is therefore derived from pGX4596 molecules which were cut 
only atlhe Xmn I site in the Protein G coding sequence. This fragment was purified by preparative 
electrophoreticlractionation on a 0.7% agarose gel. The band with lowest mobility was excised, eluted from 
the gel, and recovered. 

A double stranded oligonucleotide adapter was constructed with the structure 



5'P-TGCCGGCTA 

ACGGCCGATTCGA- 5 ' P 

The single stranded end of this oligonucleotide is complementary to the single stranded end of the 
pGX4596 fragment, generated by Hind III cleavage. The purified fragment was recirculated in the 
presence of this adapter by incubation with T4 DNA ligase under ligation conditions. The ligated DNA was 
used to transform E. coli SK2267, and an ampicillin resistant transformant containing a plasmid with the 
desired structure wis identified by restriction analysis. This plasmid was designated pGX4599. It carries a 
truncated Protein G gene with the following designed structure: 

. . . [C5]A1 aGI uThrAl aGlyEND 

. . . GCTGAAACTGCCGGCTAAGCTT . . . 



Plasmids pGX4597 and pGX4599 were used separately to transform protoplasts of B. subtilis GX8008. 
Kanamycin resistant transformants were selected and designated GX8455 (pGX4597) and GX8457 
(pGX4599). Both strains were found to synthesize protein with the immunoglobulin binding activity of 
Protein G, and in both cases this protein was detected in both the extracellular medium and the cell lysate 
fractions. 



EXAMPLE VII 



Construction of a Preferred Form of the Protein G Gene and Expression in E. coli of the Protein G 
Variant Type 1 



Derivatives of the Protein G gene with various deletions were fused to a regulable promoter system 
for advantageous expression in E. coli . The promoter used was a hybrid bacteriophage lambda promoter 
(OL/PR) constructed by in vitro methods, and described by McKenney et al . in U.S. Patent application 
534,982 (filed September237T983). This promoter is carried on a plasmid pGX2606, which includes the 
translation initiation site of the phage lambda cro gene, with a Ba mHI site, situated as follows: 

BamHI 

. TTGACTATTTTACCTCTGGCGGTGATAATGGTTGCATGTACTAAGGAGGTTGTATGGATCC . • . 
o 5 _io Translation 



In order to permit sequences encoding the "B-domains" derived from phage mGX7877 (Example IV) to 
be fused to the E. coli expression vector pGX2606, a Bam HI site was inserted in the correct frame in 
mGX7877 DNA ~by~~oligonucleotide-directed in vitro mutagenesis (see Figure 14). The following 
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oligonucleotide was synthesized: 

5 ' -pGTCAGTCTTAGGTAATGCAGGATCCGCTAAAATTTCATCTATCA6-3 ' 

BamHI 

The sequence of this oligonucleotide is complementary to those in mGX7877 encoding the region 
between domains A2 and B1, but contains a 6-nucleotide insertion which constitutes a recognition site for 
10 endonuciease Bam HI, in the correct frame for subsequent fusion to the expression signals in pGX2606. 

This oligonucleotide was used as primer to convert single-stranded mGX7877 DNA to duplex DNA in 
vitro . The resulting duplex DNA was used to transfect E coli GX1210. The RF DNA recovered from cells 
inficted from plaques was screened for the presence of the Bam HI site. One with the desired structure 
was designated mGX7893. 

Double-stranded RF DNA of mGX7893 was digested with endonucleases Bam HI and Hind ill, and the 
digested DNA was fractionated by agarose gel (1%) electrophoresis. The smaller of the two linear DNA 
bands produced, which contains sequences encoding the B1 and B2 domains of protein G, was recovered 
by elution from the gel. DNA of plasmid pGX2606 was digested with endonucleases Bam HI and Hind III, 
and the longer linear fragment recovered by phenol extraction and ethanol precipitation. The purified 
mGX7893 fragment was mixed with the recovered pPGX2606 DNA and incubated with T4 DNA iigase under 
ligation conditions. Ligated DNA was used to transform E. coli GX1201, selecting for ampicillin resistance at 
30° C. Transformants were screened for production of IgG binding protein at 42° C, as described in 
Example VII. One positive transformant was designated GX8436, and found to contain plasmid pGX4592, 
with the desired structure. 

The N-terminai coding sequence of pGX4592, verified directly by DNA sequencing, is as follows: 

fMet Asp Pro Ala Leu Pro Lys Thr Asp [ Bl] ... 
. . .AGGAGGTTGT ATG GAT CCT GCA TTA CCT AAG ACT GAC ACT TAC... 

30 

The DNA sequence of the entire gene which encodes this Protein G variant is as follows: 

1$ ZZ 30 A* 50 

35 ATGGATCCTG CATTACCTAA GACTGACACT TACAAATTAA TCCTTAATGG ™??*"IT G 

AAPGGCGAAA CAACTACTGA AGCTGTTGAT GCTGCTACTG CAGAAAAAGT CTTCAAACAA 
TACGCTAACG ACAACGGTGT TGACGGTGAA T6GACTTACG ACGATGCGAC 
ACAGTTACTG AAAAACCAGA AGTGATCGAT GCGTCTGAAT TAACACCAGC CGTGACAACT 
TACAAACTTG TTATTAATGG TAAAACATTG AAAGGCGAAA CAACTACTAA AGCAGTAGAC 

31 ft 220 330 3 AC 3S0 3&0 

GCAGAAACTG CAGAAAAAGC CTTCAAACAA TACGCTAACG ACAACGGTGT TGATGGTGTT 

TSGACTTATG ATGATGCGAC TAAGACCTTT ACGGTAACTG AAATGGTTAC AGAGGTTCCG 
GTCGCTTCAA AACQTAAAGA AGACTAA 
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40 



45 



50 



Under appropriate fermentation conditions, with expression induced at 42° C, continued at 39° C, strain 
GX8436 was shown to produce a protein of the expected size with binding activity for human IgG. This 
protein, which comprises the binding domains B1 and B2, has been designated protein G variant Type 1. 
The amino acid sequence of this variant is as follows: 

5 10 15 20 2S 30 

IMDPALPKTDTYKL I LNGKTLKGETTTEAVD 
31 AATAEKVFKQYAtJDNG V-D GEWTYDDATKTF 
61 TVTEKPEV X DASELTP A'V TTYKLVINGKTL 
91KGETTTKAVDAETAEKAFKQYANDNGVDGV 
121 WTYDDATKTFTVTEMVTEVPVASKRKED, 



55 



EXAMPLE VIII 
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Construction of Preferred Forms of the Protein G Gene for Expression in E. coli to Give Protein G 

Variants Type 2 and 3 



Derivatives of the Protein G gene with various deletions were fused to a reguiatable promoter system 
for advantageous expression in E. coli . The promoter used was a hybrid bacteriophage lambda promoter 
w (OL/PR) constructed by in vitro Methods, and described by McKenney et a! . in U.S. Patent application 
534,982 (filed September, 1983). This promoter is carried on a plasmid pGX2606, which includes the 
transla tion initiation site of the phage lambda cro gene, with a Ba mHI site, situated as follows: 

BamHI 

75 ... TTGACTATTTTACCTCTGGCGGTGATAAT6GTTGCATGTACTAAGGAGGTTGTATGGATCC. . . 
.35 -10 Translation 



In order to fuse modified Protein G genes derived from those carried on pGX4597 and pGX4599 at the Ba 
mHI site of the vector pGX2606, Ba mHI sites were created in pGX4597 and pGX4599, at the unique Kpn I 
sites of those plasmids. For this purpose, a self-complementary oligonucleotide linker was synthesized with 
the following structure: 

5'P-GGATCCGTAC 
CATGCCTAGG-5'P 



The single stranded ends of this double stranded linker are complementary to the single stranded ends 
30 generated by digestion of the plasmids with endonuclease Kpn I, and the linker incorporates a recognition 
site for endonuclease Ba mHI. 

DNA of plasmids ~pGX4597 and pGX4599 were separately digested with endonuclease Kpn I, phenol 
extracted and ethanol precipitated (see Figure 13). The digested DNA preparations were then incubated 
with the phosphorylated linker oligonucleotide and T4 DNA ligase under ligation conditions. DNA ligase was 
35 inactivated by incubation at 70 °C for 3 min, and the ligated DNA preparations were then digested with 
endonucieases Ba mHI and Hin dill. Each digested DNA preparation was then subjected to preparative 
electrophoretic fractionation oTTa 1% agarose gel. The fragment of mobility corresponding to a length of 
between 400 and 700 bp was excised, extracted from the gel, and recovered. DNA of plasmid pGX2606 
was digested with endonucieases Ba mHI and Hin dill, and subjected to preparative electrophoresis on a 
40 o.7% agarose gel to remove the small Ba mHI-Hin dill fragment. The larger band was excised, eluted, and 
recovered. 

The recovered pGX2606 fragment was mixed separately with the recovered pGX4597 and pGX4599 
fragments, and incubated with T4 DNA ligase under ligation conditions. The ligated DNA preparations were 
used separately to transform E. coli GX1201 (nadA::Tn10 delta 4(chlD-blu) (lambda c!857 delta Ba mHI)), a 

*5 phage lambda lysogen carrying the c!857 gene encoding a thermolabile repressor protein. Transformants 
were selected at 30 *C for ampicillin resistance and screened by a variation of the immunoassay procedure 
described in Example I. Cells were grown at 30° C on a cellulose acetate filter atop a nitrocellulose filter on 
a standard nutrient agar plate. After visible colonies had appeared, the plates were incubated at 42 C for 4- 
6 hours, then the nitrocellulose filter was developed as described in Example I. A positive transformant 

so identified in this way was found to contain plasmid of the desired structure by restriction analysis. The 
plasmid containing pGX4597-derived sequences was designated pGX5203, and the E. coli strain containing 
this plasmid was designated GX8464. The plasmid containing pGX4599-derived sequences was designated 
pGX5204, and the E. coli strain containing this plasmid was designated GX8465. 

The designed structures of the modified Protein G genes carried by pGX5203 and pGX5204 are the 

55 same at the N-terminus: 
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10 



fMetAspProTyrProLeuProLysThrAsp[ Bl ]. . . 
. • .AAG6AGGTTGTATGGATCCGTACCCATTACCTAAGACTGACACTTAC. . . 

The C-terminal sequences are the same as those of pGX4597 and pGX4599, respectively (Example VI). 

Both GX8464 and GX8465 were shown to produce protein variants with the immunoglobulin binding 
activity of Protein G. In addition, both strains exhibited enhanced production and produced about 200 mg of 
recovered protein-G like material per liter of bacterial culture. Synthesis of this protein, was found to be 
repressed at 30° C, and induced at 42° C, the mode of regulation expected of a gene under control of the 
hybrid phage lambda promoter OL/PR in the presence of the thermolabile repressor encoded by the c!857 
gene. 

The amino acid sequence of the protein G variant comprising the B1 and B2 binding domains (Type 2) 
produced by GX8465 is as follows: 

5 lO 15 20 25 33 

75 1MDPYPLPKTDTYKL I LNGKTLKGETTTEAV 

aiOftATftEKVFKQYPNDNBVOGEWTYOOATKT 
ti FTVTEKPEV I D A S E L T PAVTTYKLV X NGKT 
91LKGETTTKAVDAETAEKAFKQYANDNGVDG 
121 VWTYDDATKTFTVTEMVTEVPRGDAPTEPE 
151 K P E A S I PLVPLTPATPI AKDDAKKDDTKKE 
20 leiDAKKPEAKKDDAKKAETAG. 

The DNA sequence of the gene which encodes this Protein G variant (type 2) is as follows: 



25 ic 8C 30 40 50 60 

ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGTCTTCAAA 
CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 
TTTACAGTTA CTGAAAAACC AGAAGTGATC GATGCGTCTG ^TTAACRCX ^CGTGACft 
ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TARAGCAGTA 

310 320 330 340 350 360 

GACGCAGAAA CTGCAGAAAA AGCCTTCAAA CAATACGCTA ACGACAACGG JGTTGATGGT 
GTTTGGACTT ATGATGATGC GACTAAGACC TTTACGGTAA CTGAAATGGT TACAGAGGTT 
CCTCGAGGTG ATGCACCAAC TGAACCAGAA AAACCAGAAG CAAGTATCCC JCTTGTTCCG 
TTAACTCCTG CAACTCCAAT TBCTAAAGAT GACGCTAAGA AAGACGATAC JAAGAAAGAA 
GATGCTAAAA AACCAGAAGC TAAGAAAGAT GACGCTAAGA AAGCTGAAAC TGCCGGCTAA 



30 



35 



40 



The amino acid sequence of the protein G variant Type 3 produced by GX8464 which comprises the 
B1 and B2 binding domains is as follows: 

- 5 10 15 23 25 3C 

iMDPYPLPKTDTYKL I LNGKTLKGETTTEAV 
31 DAATAEKVFKOYANDNGVDGEWTYDDATKT 
61 FTVTEKPEVI 0ABELTPAVTTYKLV1NGKT 
91LKGETTTKAVDAETAEKAFKQYANDN6VDG 
45 121VWTYDDATKT. FTVTEMVT'EVPRSC 

The DNA sequence of the gene which encodes this Protein G variant (type 3) is as follows: 

pA 30 40 SO 60 

50 ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA 2££g*J 

TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA gSTCTTCAAA 

s Si Si Si 3b 

fpot 330 340 350 360 

55 S5SS sassss sssss sssss SBSSS S3KK 

CCTCGATCGT GCTAA 
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5 Construction of the Protein G Gene and Expression in E. coll to Give Protein G Variant Type 5 

Piasmid pGX4599 was used to transform E coli GM272, a strain which lacks the dam methylase. 
Plasmid DNA obtained from one such transformant was digested with restriction endonuclease Cla I. 
w Because of the absence of dam -induced methylation in this DNA, digestion produced two fragments. 
These fragments were recovered by pheno! extraction and ethanol precipitation. 

A synthetic seif-complementary oligonucleotide adapter was constructed with the following sequence: 

75 5'-CGCCTGGATCCAGG-3 / 

3'-GGACCTAGGTCCGC-5' 



20 
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30 



35 



40 



45 



This double-stranded oligonucleotide has single-stranded ends complementary to those generated by 
endonuclease Cla I and contains a recognition sequence for endonuclease Bam HI. 

The s'-phosphorylated oligonucleotide adapter was mixed with the recovered pGX4599 DNA fragment. 
The mixture was incubated with T4 DNA ligase under ligation conditions at a DNA concentration of 
approximately 10 micrograms per ml. After ligation, the recircularized DNA was used to transform E. coli 
SK2267, selecting for ampicillin resistance. Only one of the two fragments contains a replication origin 
active in E. coli and the selectable ampicillin resistance marker, so only recircularized DNA containing this 
fragment canTansform E. coli . Transformants were screened by restriction analysis of plasmid DNA, and 
one containing plasmid withlhe desired structure (pGX5241) was identified. 

Plasmids pGX5241 and pGX2606 were both digested with endonucleases Hind ill and Bam HI. 
Digested DNA was recovered following phenol extraction and ethanol precipitation. The two digested 
plasmids were mixed in ligation buffer at approximately 50 micrograms DNA per ml and ligated in the 
presence of T4 DNA ligase. Ligated DNA was used to transform E. coli GX1201, selecting for ampicillin 
resistance at 30° C, and transformants were screened by restriction analysis of plasmid DNA. One 
containing plasmid with the desired structure (pGX5245) was designated strain GX8822. The correct 
structure was verified by DNA sequencing. The DNA sequence of the gene which encodes this Protein G 
variant is as follows: 

«« £0 30 40 30 60 

PTGGPTCCPG GCGPTGCGTC TGPPTTPPCP CCPGCCGTGP CPPCTTPCPP «CTTGTTPTT 
PPTGGTPPPP CPTTGPPPGG CGPPPCPPCT PCTPPPGCPG TPGPCGCPGP £«CTGCPGPP 
PPPGCCTTCP PPCPPTPCGC TPPCGPCPPC GGTGTTGPTG GTGTTTGGPC TTPTGPTGPT 

gSgpcSSgS cctttpcggt ppctgppptg gttpcpgpgg ttcctcgpgg tgptgcpccp 

PCTGPPCCPG PPPPPCCPGP PGCPPGTPTC CCTCTTGTTC CGTTPPCTCC TGCPPCTCCP 

31 ft 320 330 340 350 360 

PTTGCTPPPG PTGPCGCTPP GPPPSACGPT PCTPPGPPPG PPGPTGCTPP PPPPCCPGPP 
GCTPPGPPPG PTGPCGCTPP GPPPGCTGPP PCTGCCGGCT RP 



Strain GX8822 was found to produce a protein of the expected size with the ability to bind to human 
IgG This protein G variant contains a single IgG binding sequence B2 (from GX7809 protein G), the 
adjacent proiine-rich region, and the "C-repeats." The predicted amino acid sequence for this Protein G 
so variant type 5 is as follows: 

5 10 15 80 £5 30 

1MDPGDPSELTPPVTTYKLVINGKTLKGETT 
■MTKPVDAETflEKPFKGYPNDNGVDGVWTYDD 
llPTKTFTVTEWVTEVPRGDPPTEPEK PEPSI 
55 91 PLVPLTPPTP I PKDDPKKDDTKKEDPKKPE 

lcl PKKDDPKKPETPG* 
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EXAMPLE X 



Construction of Protein G Genes Encoding Protein G Variants Types 6 and 11 



Plasmid pGX5204 DNA was digested with restriction endonuclease Pst I, and the long linear fragment 
was isolated following electrophoretic fractionation on a 1% agarose gel, and eiution of the purified fragment 

w from the gel. DNA of plasmid pGX4584 (which consists of the vector pGX1066 plus a DNA insert of 2.4 kbp 
containing the protein G gene isolated from Streptococcus GX7805 (see Example V), was subjected to 
partial digestion with Pst I, under conditions such that significant quantities of a 420 bp fragment derived 
from the region encoding the IgG-binding domains of protein G were present. This 420 bp Pst I fragment 
was isolated following agarose gel (1.5%) electrophoresis and eiution from the gel. 

75 The long Pst I fragment of pGX5204 and the 420 bp Pst I partial digest fragment from pGX4583 were 
mixed and ligited with T4 DNA ligase. The ligated DNA was used to transform E. coli GX1201, and 
transformants were screened by restriction analysis of plasmid DNA. Plasmids of two types were identified. 
One (pGX5247) was derived from the plasmid pGX5204 by substituting the 420 bp Pst I fragment of 
pGX4583 for the 210 bp fragment present in pGX5204. The strain containing pGX5247 was designated 

20 GX8825 and was shown to produce an IgG-binding protein of the expected size. The predicted structure of 
this protein (type 6) is as follows: 

1 MDPYPLPKTDTYKL I LNGKTLKGETTTEAV 
31 DAATAEKVFKGYANDNGVDGEWTYDDATKT 
61 FTVTEKPEV Z DASELTPAVTTYKLV X NGKT 
91 LKGETTTEAVDAATAEKVFKOYANDNGVDG 
121EWTYDDATKTFTVTEKPEV1DASELTPAVT 

leiQYPNONGVDGVWTVDDl.TKTFTVTEMVTnv 
211 PRGDAPTEPEKPEAS 1 P L V P L T P A T P I A K D 



The DNA sequence of the gene which encodes this Protein G variant (type 6) is as follows: 



ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
TTGAAAGGC6 AAACAACTAC TGAAGCTGTT 6ATGCTGCTA CTGCAGAAAA AGTCTTCAAA 
CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 
TTTACAGTTA CTGAAAAACC AGAAGTGATC GATGCGTCTG AATTAACACC AGCCGTGACA 
ACTTACAAAC TT6TTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 

31© 320 33© 3*0 35© 360 

GACGCAGAAA CTGCAGAAAA AGTCTTCAAA CAATACGCTA ACGACAACGG TGTTGACGGT 
GAATGGACTT ACGACGATGC GACTAAGACC TTTACAGTTA CTGAAAAACC AGAAGTGATC 
GATGCGTCTG AATTAACACC AGCCGTGACA ACTTACAAAC TTGTTATTAA TGGTAAAACA 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGCCTTCAAA 
CAATACGCTA ACGACAACGG TGTTGATGGT GTTT6GACTT ATGATGATGC GACTAAGACC 

6 1 ft «C 630 640 650 660 

TTTACGGTAA CTGAAATGGT TACAGAGGTT CCTCGAGGTG ATGCACCAAC TGAACCAGAA 
AAACCAGAAG CAAGTATCCC TCTTGTTCCG TTAACTCCTG CAACTCCAAT TGCTAAAGAT 
GACGCTAAGA AAGACGATAC TAAGAAAGAA GATGCTAAAA AACCAGAAGC TAAGAAAGAT 
GACGCTAAGA AAGCTGAAAC TGCCGGCTAA 



The sequence contains the B1, B1/B2, and B3 binding domains, the amino acid sequences of which are 
identical to those of protein G derived from Streptococcus GX7805, plus the adjacent proline-rich region 
and the "C-repeats" of protein G. 

The second type of plasmid (pGX5246) isolated from among the above transformants was derived from 
pGX5204 by recircularization of the long Pst I fragment without an insert. In this plasmid, therefore, the 
short 210 bp Pst I fragment of pGX520T1s deleted. The strain containing pGX5246 was designated 
GX8824. The predicted structure of the protein produced by GX8824 (type 11) is shown below: 
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The DNA sequence of the gene (type 11) which encodes this Protein G variant is as foliows: 



10 80 30 

ATGGATCCGT ftCCCftTTftCC TftftGftCTGftC 
TTGftflftGGCG ftftftCftftCTftC TGftftGCTGTT 
CftfiTftCGCTft ftCGftCftftCGG TGTTGftTGGT 
TTTflCGGTftft CTGftflftTGGT TftCftGftGGTT 
ftftftCCftGftftG CftfiGTftTCCC TCTTGTTCCG 

310 380 330 

GACGCTAAGA ftftGftCGftTftC TftftGAftftGftA 
GftCGCTftftGft ftftGCTGftftftC TGCCGGCTftft 



40 50 60 

ftCTTftCftftfiT TftftTCCTTftft TGGTAAAACA 
GftTGCTGCTft CTGCftGftftftft fiGCCTTCftftA 
GTTTGGftCTT ftTGftTGATGC GACTAAGACC 
CCTCGAGGTG ATGCACCAAC TGAACCAGAA 
TTAACTCCTG CAACTCCAAT TGCTAAAGAT 

340 350 3&C 

GATGCTftAAA AACCAGAAGC TAAGAAAGAT 



This sequence contains a single "B repeat" sequence which is a chimera of sequences derived from 
sequences B1 and B2 of GX7809 protein G. 



EXAMPLE XI 



Construction of a Protein G Gene Encoding Protein G Variant Type 7 



Plasmid pGX4595 was used to transform E. coli GM272, a strain which lacks the dam methylase. 
Plasmid DNA obtained from one such transformant was digested with restriction endonuciease Cla I. 
Because of the absence of dam -induced methylation in this DNA, digestion produced two fragments. 
These fragments were recovered by phenol extraction and ethanol precipitation. 

A synthetic self-complementary oligonucleotide adapter was constructed with the following sequence: 



5'-CGCCTGGATCCAGG-3' 
3'-GGACCTAGGTCCGC-5' 

This double-stranded oligonucleotide has single-stranded ends complementary to those generated by 
endonuciease Cla I, and contains a recognition sequence for endonuciease Bam HI. 

The 5-phoIphorylated oligonucleotide adapter was mixed with the recovered pGX4595 DNA fragment. 
The mixture was incubated with T4 DNA ligase under ligation conditions at a DNA concentration of 
approximately 10 micrograms per ml. After ligation, the recircularized DNA was used to transform E. coli 
SK2267, selecting for ampicillin resistance. Only one of the two fragments contains a replication origin 
active in E. coli and the selectable ampicillin resistance marker, so only recircularized DNA containing this 
fragment cafTFansform E. coli . Transformants were screened by restriction analysis of plasmid DNA, and 
one containing plasmid withlhe desired structure (pGX5240) was identified. 

Plasmids pGX5240 and pGX2606 were both digested with endonucleases Hind Hi and Bam HI. 
Digested DNA was recovered following phenol extraction and ethanol precipitation. The two digested 
plasmids were mixed in ligation buffer at approximately 50 micrograms DNA per ml and ligated in the 
presence of T4 DNA ligase. Ligated DNA was used to transform E. coli GX1201, selecting for ampicillin 
resistance at 30 e C, and transformants were screened by restriction analysis of plasmid DNA. One 
containing plasmid with the desired structure (pGX5244) was designated strain GX8821. The correct 
structure was verified by DNA sequencing. The DNA sequence of the gene which encodes this Protein G 
variant is as follows: 
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Strain GX8821 was found to produce a protein of the expected size with the ability -to bind to human 
lgG. The predicted structure of this protein is as follows: 

5 iz 15 2* 25 3C 

IMDPGDASELTPflVTTYKLVINGKTLK&EIT 

31 TKAVOAET A E KflPKOYPNDNGVDGVWTYDD 
61ATKTFTVTEMVTEVPVASKRKED, 

This structure contains a single lgG binding sequence B2 (from GX7809 protein -G) plus several amino 
acid residues derived from the C-terminus of the natural protein. 
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Construction of a Protein G Gene Encoding Protein G Variant Type 8 

DNA of plasmid pGX5204 was digested with endonuclease Hind III, and linear DNA was recovered by 
phenol extraction and ethanol precipitation. A self-complementary synthetic DNA linker was constructed with 
the following sequence: 

5 ' - AGCTTAGCATGAAGGCCTTCATGCTA-3 ' 

3 ' - ATCGTACTTCCGG AAGTACGATTCG A- 5 ' 

This double-stranded oligonucleotide has single-stranded ends complementary to those generated by 
endonuclease Hind III, plus a recognition sequence for endonuclease Stu I. Linearized pGX5204 DNA was 
mixed with the 5 -phosphorylated oligonucleotide in ligation buffer and incubated with T4 DNA ligase. The 
ligated DNA was used to transform E. coii GX1201 and transformants were screened by restriction analysis 
of plasmid DNA (for the presence ofThFStu I site). A transformant with the desired plasmid (pGX5254) was 
identified. 

DNA of pGX5254 was digested with endonuclease Nae I. Digestion was found to be incomplete, so 
linear molecules were purified from the digest by agarose gel (1%) electrophoresis, eluted from the gel, and 
recovered. The recovered linear molecules were then digested with Stu I, and the long linear fragment 
recovered by phenol extraction and ethanol precipitation. Both of these enzymes leave blunt ends. The 
linear DNA was then recircularized by incubation at approximately 10 micrograms per mi in ligation buffer 
with T4 DNA ligase. The desired effect on the DNA structure is as follows: 

„.Lys Lys Ala Glu Thr Ala Gly END < Linker > 

...AAG AAA GCT GAA ACT GCC GGC TAA GCTTAGCATGAAGGCCTTCATGCTAAGCTT, . . 



Nae I Hind I II Stu I Hind II I 

i 



...Lys Lys Ala Glu Thr Ala Pro Ser Cys END 
...AAG AAA GCT GAA ACT GCC CCT TCA TGC TAA GCTT. 
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An in-frame fusion is formed, the effect of which is to substitute a ProSerCys sequence at the C- 
terminus in place of the C-terminal Gly of the protein encoded by pGX5204. 

The recircularized DNA was used to transform E. coli GX1201, and transformants were screened by 
restriction analysis of plasmid DNA (for loss of the Nae I site). One with the desired structure (pGX5255) 
5 was designated strain GX8833. This strain was shown to produce an IgG-binding protein of the expected 
size, with approximately one Cys residue per molecule. The structure of the C-terminal coding region of 
pGX5255 was verified directly by DNA sequencing. 

The DNA sequence of the gene which encodes this Protein G variant is as follows: - 

70 10 20 30 A0 50 60 

1 ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 

61 TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCA6AAAA AGTCTTCAAA 

121 CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 

181 TTTACAGTTA CTGAAAAACC AGAAGTGATC GATGCGTCTG AATTAACACC AGCCGTGACA 

241 ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 

15 3iq 330 330 340 350 360 

301 GACGCAGAAA CTGCAGAAAA AGCCTTCAAA CAATACGCTA ACGACAACGG TGTTGATGGT 
361 GTTTGGACTT ATGATGATGC GACTAAGACC TTTACGGTAA CTGAAATGGT TACAGAGGTT 
421 CCTCGAGGTG ATGCACCAAC TGAACCAGAA AAACCAGAA6 CAAGTATCCC TCTTGTTCCG 
481 TTAACTCCTG CAACTCCAAT TGCTAAA6AT GACGCTAAGA AAGACGATAC TAAGAAAGAA 
541 GATGCTAAAA AACCAGAAGC TAAGAAAGAT GACGCTAAGA AAGCTGAAAC TGCCCCTTCA 



20 



601 TGCTAA 



The predicted amino acid sequence of the Protein G variant comprising the B1 and B2 domains, 
25 expressed by this gene, is as follows: 



30 













5 








10 








1 


5 








20 






25 








30 


\ 


M 


D 


P 


Y 


P 


L 


P 


K 


T D 


T 


Y 


K 


L 


I 


L 


N 


G 


K T 


L K 


G 


E T 


T 


T 


E 


A V 


31 


D 


A 


A 


T 


A 


£ 


K 


V 


F K 


Q 


Y 


A 


N 


D 


N 


G 


V 


D G 


E W 


T 


Y D 


D 


A 


T 


K T 


61 


F 


T 


V 


T 


E 


K 


P 


E 


V I 


D 


A 


S 


E 


L 


T 


P 


A 


V T 


T Y 


K 


L V 


I 


N 


G 


K T 


91 


L 


K 


G 


E 


T 


T 


T 


K 


A V 


D 


A 


E 


T 


A 


E 


K 


A 


F K 


Q Y 


A 


N D 


N 


G 


V 


D G 


121 


V 


W 


T 


Y 


D 


D 


A 


T 


K T 


F 


T 


V 


T 


E 


M 


V 


T 


E V 


P R 


G 


0 A 


P 


T 


E 


P E 


151 


K 


P 


E 


A 


S 


1 


P 


L 


V P 


L 


T 


P 


A 


T 


P 


I 


A 


K D 


D A 


K 


K D 


D 


T 


K 


K E 


181 


0 


A 


K 


K 


P 


E 


A 


K 


K D 


D 


A 


K 


K 


A 


E 


T 


A 


P S 


C. 















35 



40 



EXAMPLE XIII 



Construction of a Protein G Gene Encoding Protein G Variant Type 9 



DNA of plasmid pGX5204 was digested with endonuclease Hind III, and linear DNA was recovered by 
45 phenol extraction and ethanol precipitation. A double-stranded synthetic DNA linker was constructed with 
the following sequence: 

5' - AGCTGTTAACCAGCTGCTA-3 ' 
so 3 ' -CAATTGGTCGACGATTCGA-5 ' 

This double-stranded oligonucleotide has single-stranded ends complementary to those generated by 
endonuclease Hind III, plus recognition sequences for endonucleases Hpa I and Pvu II. Linearized pGX5204 
DNA was mixecTwith the 5'-phosphorylated oligonucleotide in ligation buffer and incubated with T4 DNA 
ligase. The ligated DNA was used to transform E. coli GX1201 and transformants were screened by 
restriction analysis of plasmid DNA (for the presence of the Pvu II site). The linker can be ligated to the 
linearized plasmid DNA in either of two orientations. Therefore, several transformants containing plasmids 
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which had acquired the linker ( Pvu II site) were identified, some of which were expected to contain the 
linker in the desired orientation (eTg7pGX5267). 

DNA of pGX5267 was digested with endonuclease Hpa I, and the linear DNA recovered by phenol 
extraction and ethanol precipitation. The linear DNA was then recircularized by incubation at approximately 
10 micrograms per ml in ligation buffer with T4 DNA ligase. The desired effect on the DNA structure is as 
follows: 



... Val Pro Leu Thr Pro ... Ala Gly END < LINKER-— —> 

. . . GTT CCG TTA ACT CCT ... GCC GGC TAA GCTGTTAACCAGCTGCTAAGCTT . . . 



Hoa l Hfial EYuII 

i 

. . . Val Pro Leu Thr Ser Cys END 

. . . GTT CCG TTA ACC AGC TGC TAA GCTT . . . 



Pvu 1 1 Hjndlll 



An in-frame fusion is formed, the effect of which is to detect the C-repeats present in the GX5204- 
encoded protein, and to substitute a SerCys sequence at the C-terminus. This creates a Cys residue which 
is unique in the protein. Note that if the linker were present in the opposite orientation, deletion between the 
two Hpa I sites would remove the Pvu II site and the Hind III site. 

The recircularized DNA was used to transform E. coli GX1201, and transformants were screened by 
restriction analysis of plasmid DNA (for shortening of the Bam Hl-Hind III fragment which contains the 
protein G coding sequence). Several such deleted plasmids were identified. In order to determine which 
had originally acquired the DNA linker in the desired orientation, the plasmids were screened for the 
presence of the Hind III and Pvu II sites. One with the desired structure (pGX5268) was designated strain 
GX8846. The DNA~iequence ofthe gene which encodes this Protein G variant is as follows: 

10 20 30 AG 50 6& 

1 ATGGATCCGT ACCCATTACC TAAGACT6AC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
61 TTGAAAGGCG AA AC A ACT AC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA A6TCTTCAAA 
121 CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 
161 TTTACAGTTA CTGAAAAACC AGAAGTGATC GATGCGTCTG AATTAACACC AGCCGTGACA 
241 ACTTACAAAC TTGTTATTAA TGGTAAAACA TTGAAAGGCG AAACAACTAC TAAAGCAGTA 

310 320 330 340 350 360 

301 GACGCAGAAA CTGCAGAAAA AGCCTTCAAA CAATACGCTA ACGACAACGG TGTTGATGGT 

361 GTTTGGACTT ATGATGATGC GACTAAGACC TTTACGGTAA CTGAAATG6T TACAGAGGTT 

421 CCTCGAGGTG ATGCACCAAC TGAACCAGAA AAACCAGAAG CAAGTATCCC TCTTGTTCCG 
481 TTAACCAGCT GCTAA 

The predicted amino acid sequence of the Protein G variant expressed by this strain, comprising the B1 
and B2 binding domains, is as follows: 



S 10 15 20 * 25 30 

1 HDPYPLPKTDTYKL ILNGKTLKGEfTTEAV 
31DPATPEKVFKQYANDNGVDGEWTYDDATKT 
eiFTVTEKPEVXDASELTPAVTTYKLVlNGKT 
91LKGETTTKAVDAETAEKAFK0YANDNGVD6 
121VWTYDDATKTFTVTEMVTEVPRGDGPTEPE 
SS1KPEA6XPLVPLT6C 



EXAMPLE XIV 
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Construction of a Protein G Gene Encoding Protein G Variant Type 10 

Plasmid pGX4599 was used to transform E. coli GM272, a strain which lacks the dam methylase. 
Plasmid DNA obtained from one such transformFnt was partially digested with restriction endonuc ease Cla 
I under conditions where significant quantities of full-length linear DNA were formed There ^are two Cla , I 
sites in pGX4599. Because of the absence of dam -induced methylat.on in this DNA, both of these sites are 
available for cleavage by Cla I. The once-cutTirTear DNA is therefore of two types. This linear DNA was 
purified by agarose gel elictrophoretic fractionation and elution from the gel, followed by digestion w.th 
endonuclease Xho I. The longest fragment so generated was again purified by agarose gel electrophoresis, 

eluted and was then recovered. 

A double-stranded synthetic oligonucleotide adapter was constructed with the followmg sequence: 



20 



40 



5'-CGACGTCCC-3' 
3 ' -T6CAGGGAGCT- 5 ' 



This oligonucleotide has single-stranded ends complementary to those generated by endonucleases 
Cla I and Xho I, respectively, and a recognition site for endonuclease Aat II. 

— The pTirified Cla l-Xho i fragment from pGX4599 was mixed with the 5 -phosphorylated adapter 
oligonucleotide in Ration-buffer and incubated with T4 DNA ligase. The ligated I DNA jvas ^ tomtorm 
25 E coli DHSalpha (F-, endA1, hsdR17, supE44, thi-1, recAL gyrA96, relA1, delta (argF-lacZYA)U169 
pni80dlacZdeltaM15; available from Bethesda Research Laboratories, Inc.), and ^ormants were 
screened by restriction analysis of plasmid DNA (for decrease in the size of the Bam Hl-Hmd I II fragmen 
containing the protein G coding sequences). One containing a plasmid with the desired structure ( P GX5265) 

30 W ^JofnTng e t d he Cla I and Xho I ends of the linear molecule through the synthetic adapter oligonucleotide 
produces an in-frame fusion: 

... Glu Val He Asp Ala ... Val Pro Arg Gly Asp ... 
35 . . . GAA GTG ATC GAT GCG ... GTT CCT CGA GGT GAT ... 

Cla l Xhol 
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... Glu Val lie Asp Val Pro Arg Gly Asp ... 
... GAA GTG ATC GAC GTC CCT CGA GGT GAT ... 

Aat 1 1 

In order to fuse the modified Protein G gene carried on P GX5265 at the Bam HI site of the vector 
DGX2606 a Bam HI site was created in pGX5265 at the unique Kpn I sites of that plasm.d. For this 
purpose, a seiFcQ-mplementary oligonucleotide linker was synthesized with the following structure: 

5'P-GGATCCGTAC 
CATGCCTAGG-5'P 

The single-stranded ends of this double-stranded linker are complementary to the single -^nde d ends 
generated by digestion of the plasmid with endonuclease Kpn I, and the linker incorporates a recognifon 
site for endonuclease Bam HI. 
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DNA of plasmid pGX5265 was digested with endonuclease Kpn I, phenol extracted and ethanol 
precipitated. The digested DNA preparation was then incubated with the phosphorylated linker 
oligonucleotide and T4 DNA ligase under ligation conditions. DNA iigase was inactivated by incubation at 
70 °C for 5 min, and the ligated DNA preparation was then digested with endonucleases Bam HI and Hind 

5 III. The digested DNA preparation was then subjected to preparative electrophoretic fractionation on a 1.4% 
agarose gel. Two DNA fragments were observed on the ethidium bromide-stained gel, and the fragment of 
greater mobility was excised, extracted from the gel, and recovered. DNA of plasmid pGX2606 was 
digested with endonucleases Bam HI and Hind HI, and the large linear fragment recovered by phenol 
extraction and ethanol precipitation. 

w The recovered pGX2606 fragment was mixed with the recovered pGX5265 fragment, and incubated 
with T4 DNA ligase under ligation conditions. The ligated DNA preparation was used to transform E. coli 
GX1201. Transformants were selected at 30* C for ampicillin resistance and screened by restriction analysis 
of plasmid DNA (for the correctly sized Bam Hl-Hind III fragment). One was found to contain plasmid of the 
desired structure, pGX5266. This strain was designated GX8844. 

15 The DNA sequence of the gene which encodes this Protein G variant is as follows: 

10 BZ 30 40 50 6© 

ATGGATCCGT ACCCATTACC TAAGACTGAC ACTTACAAAT TAATCCTTAA TGGTAAAACA 
TTGAAAGGCG AAACAACTAC TGAAGCTGTT GATGCTGCTA CTGCAGAAAA AGTCTTCAAA 
CAATACGCTA ACGACAACGG TGTTGACGGT GAATGGACTT ACGACGATGC GACTAAGACC 
TTTACAGTTA CTGAAAAACC AGAAGTGATC GACGTCCCTC GAGGTGATGC ACCAACTGAA 
CCAGAAAAAC CA6AAGCAAG TATCCCTCTT GTTCCGTTAA CTCCTGCAAC TCCAATTGCT 

3X0 330 330 3 AO 350 360 

AAAGATGACG CTAA6AAAGA CGATACTAAG AAAGAAGATG CTAAAAAACC AGAAGCTAAG 
25 AOAGPTGACG CTAAGAAAGC TGAAACTGCC GGCTAA 

The predicted amino acid sequence of the Protein G variant expressed by this strain, comprising the 
single domain B1, is as follows: 
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1 HOPYPLPKTOTYKL I LNGKTLKGETTTEAV 
31DAATAEKVFKQYANDNGVDGEWTYDDATKT 
61FTVTEKPEVI OVPRGOAPTEPEKPEASI PL 
91 VPUTPATP IAKDDAKKDDTKKEDAKKPEAK 
121 KDDAKKAETA'G* 



EXAMPLE XV 



Immobilization of Single Domain Protein G Variants on Tresyl-Activated Sepharose 4B 

45 

The freeze-dried powder (Pharmacia) of Tresyl-activated Sepharose 4B was suspended in 1 mM HCI. 
After washing the gel on a sintered glass filter with 1 mM HCI, it was sucked dry by aspiration. Protein G 
variants were dissolved in 0.1 M NaHC0 3 containing 0.5 M NaCI at pH 8.0 to a concentration of 5 mg/ml. 
About 1 g of the damp gel was added to about 1 ml of the protein solution. The suspension was then 
50 rotated end-over-end at 4°C or room temperature overnight. Thereafter, the gel was treated with a 0.1 M 
Tris buffer pH 8.0 for a few hours. The gel was then washed with the coupling buffer and three cycles of a 
pH 4 buffer (0.1 M sodium acetate, 0.5 M NaCI) and a pH 8 buffer (0.1 M Tris, 0.5 M NaCI). The gel^was 
then stored in phosphate buffered saline (50 mM phosphate, 0.15 M NaCI, 0.005% NaN 3 pH 7-7.4) at 4° C. 
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EXAMPLE XVI 
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Characterization of the Immobilized Protein G Variants 



10 



15 



Three single domain Protein G variants-Type 5 (B2), Type 10 (B1) and Type 11 (B1/B2)-were 
characterized. Typically, the immobilized proteins from Example XV were packed in medium pressure glass 
chromatography columns and connected to an HPLC system. 

The extent of immobilization was determined by the difference in absorbance at 280 nm before and 
after immobilization. Interfering elements in the reaction were removed by dialysis. Alternatively, any 
standard protein quantitation assay (e.g., BCA, Lowry or Coomassie blue) could be used to determine the 
extent of protein immobilization. The coupling and capacity results are summarized in Table 1. To 
determine IgG binding capacity, human serum was applied onto the column. After the column was washed 
extensively with PBS, bound IgG was eluted with 5 mM ammonium acetate (pH 5), 1.0 M acetic acid and 
equilibrated with PBS at the end. The amount of IgG eluted was quantitated by absorbance at 280 nm using 
an extinction coefficient of 1 .4. 

Table 1 



20 



Types 


Protein Q 


Coupling 


Human IgG 


coupled/mi gel 


efficiency 


bound/ml gel 


5 


3.57 mg 


75% 


16.6 mg 


10 


4.04 mg 


78% 


10.1 mg 


11 


3.84 mg 


81% 


11.12mg 
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The binding characteristics of the immobilized variants toward F c , F ab , F(ab ) 2 and IgG were then 
examined. After the columns were equilibrated with PBS, IgG or its proteolytic fragments or mixtures 
thereof were applied to the column. Extensive washing with PBS was then carried out. The column was then 
eluted with 5 mM ammonium acetate pH 5, 0.5 M ammonium acetate pH 3, 1.0 M acetic acid and finally 
equilibrated with PBS pH 7.0. The amount of protein was quantitated by measuring absorbance at 280 nm. 
The fractions were also analyzed by SDS-PAGE to ascertain the quality of the product. 

Fc fraqment (1.0 to 1.3 mg loaded) was applied to immobilized variants Types 5, 10 and 11. The 
immobilized Type 5 variant was in a 1.16 ml column and the Types 10 and 11 were in 0.75 ml columns. 
The results appear in Table 2: 

Table 2 
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The Binding of Fc Fragments to Immobilized Variants Under 
Different Conditions 




Type 5 


Type 10 


Type 1 1 


mg 1 


% 2 


mg 


% 


mg 


% 


Initial flow through 


0.17 


18 


0.05 


5 


0.11 


8 


pH = 5 


0.04 


4 


0.00 


0 


0.00 


0 


pH = 3 


0.13 


14 


0.31 


29 


0.34 


25 


1 .0 M Acetic acid 


0.35 


38 


0.39 


37 


0.41 


31 


PBS equilibration 


0.24 


26 


0.31 


29 


0.49 


36 



50 



1 Based on A280. 

Expressed as the percentage of the total material recovered. 



With Types 10 and 11, 100% of the Fc fragment loaded was recovered in the various f racoons. With 
55 Tvoe 5 93% of the Fc fragment loaded was recovered in the various fractions. 

The results for IgG binding (1.0 to 4.6 mg loaded) to immobilized variants Types 5 (1.16 ml column), 10 
and 11 (0.75 ml columns) appear in Table 3: 
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Table 3 



The Binding of igQ to immobilized Variants Under Different 
Conditions 




Type 5 


Type 1 0 


Type 1 1 


mg 1 


% 2 


mg 


% 


mg 


% 


Initial flow through 


0.00 


0 


0.00 


0 


0.49 


11 


pH = 5 


ND 3 


ND 


0.00 


0 


ND 


ND 


pH = 3 


0.02 


2 


0.00 


0 


1.62 


35 


1 .0 M Acetic acid 


0.40 


51 


0.16 


100 


1.87 


40 


PBS equilibration 


0.37 


47 


0.00 


0 


0.66 


14 



1 Based on A 2 so. 

Expressed as the percentage of the total material recovered. 
3 ND = Not Done 



20 With Type 5, 79% of the loaded IgG was recovered in the various fractions. With Type 10, 16% of the 
loaded IgG was recovered in the various fractions. With Type 11, 100% of the loaded IgG was recovered in 
the various fractions. 

The results for Fab binding (1.0 to 1.1 mg loaded) to immobilized variants Type 5 (1.16 ml column), 
Type 10 and Type 11 (0.75 ml columns) appear in Table 4: 

25 

Table 4 



The Binding of Fab Fragments to Immobilized Variants Under 
Different Conditions 




Type 5 


Type 10 


Type 1 1 


mg 1 


% 2 


mg 


% 


mg 


% 


Initial flow through 


0.57 


83 


0.29 


66 


0.59 


48 


pH = 5 


0.00 


0 


0.00 


0 


0.07 


14 


pH = 3 


0.00 


0 


0.00 


0 


0.00 


0 


1 .0 M Acetic acid 


0.00 


0 


0.00 


0 


0.00 


0 


PBS equilibration 


0.12 


17 


0.15 


34 


0.33 


38 



1 Based on A 2 so. 

Expressed as the percentage of the total material recovered. 



With Type 5, 69% of the loaded Fab was recovered in the various fractions. With Type 10, 44% of the 
45 loaded Fab was recovered in the various fractions. With Type 11, 87% of the loaded Fab was recovered in 
the various fractions. 

The results for F(ab') 2 binding (1.0 mg loaded) to immobilized variants Type 5 (1.16 ml column), Type 
10 and Type 11 (0.75 ml columns) appear in Table 5: 

50 



55 



36 



EP0 408 178 A1 



Table 5 



The Binding of F(ab') 2 to Immobilized Variants Under Different 
Conditions 




Type 5 


Type 10 


Type 1 1 


mg 1 


% 2 


mg 


% 


mg 


% 


Initial flow through 


0.32 


75 


0.18 


55 


0.42 


60 


pH = 5 


0.04 


9 


0.00 


0 


0.12 


6 


pH = 3 


0.00 


0 


0.00 


0 


0.00 


0 


1 .0 M Acetic acid 


0.00 


0 


0.00 


0 


0.00 


0 


PBS equilibration 


0.07 


16 


0.14 


45 


0.33 


34 



1 Based on A 2 so. 

Expressed as the percentage of the total material recovered. 



With Type 5, 43% of the loaded F(ab')2 was recovered in the various fractions. With Type 10, 31% of 
20 the loaded F(ab')2 was recovered in the various fractions. With Type 11, 86% of the loaded F(ab 1 )2 was 
recovered in the various fractions. 

As shown in Table 2, the binding of F c to these three immobilized variants is quite similar. However, 
there are notable differences in the way IgG interacts with these variants. Type 10 (B1) binds IgG very 
tightly and irreversibly under the elution conditions. Type 5 (B2) also binds IgG quite tightly but reversibly. 
25 With Type 11, the binding is moderate and shows the potential to fractionate IgG's into subclasses, since 
IgG's elute in different fractions. 

In all cases, the immobilized variants exhibited much weaker binding to Fab and F(ab )2 fragments in 
comparison to IgG and Fc fragments. Therefore, these immobilized variants are useful for separating IgG 
and Fc fragments from Fab and F(ab , ) 2 fragments. 
30 Since both immobilized Type 10 and Type 11 variants show some affinity for F ab fragments, the 
immobilized Type 5 variant stands out as a candidate for isolating F ab s. To illustrate the potential to purify 
F ab 's a mixture of F c , F(ab') 2 , IgG and fluorescein labeled F ab was applied to a Type 5 column. Almost all 
the fluorescence was recovered in the flow through. To verify that the lack of binding was not due to 
fluorescein modification, in a separate experiment, a mixture containing fluorescein-labeled Fc fragment was 
chromatographed on the column. In this case, the majority of the fluorescence bound to the column which 
showed that the fluorescein modification did not affect the binding characteristics of the column. 



35 



EXAMPLE XVII 

40 



Immobilization of a Cysteine Containing Variant on a Maleimide-Activated Sepharose Derivative 

45 

AH-Sepharose (Pharmacia) was swollen in water and washed with 0.5 M sodium chloride, water and 0.1 
M sodium phosphate pH 7.0. To the gel suspension, sulfosuccinimidyl 4-( p -maleimidopheny!) butyrate 
(Pierce) was added and stirred at room temperature for one hour. The gel was then washed with 0.1 M 
sodium phosphate pH 7.0 buffer. About 6 mg of the cysteine containing variant Type 3 was dissolved in the 

so pH 7 0 buffer (1.1 ml). The suction-dried activated gel was then added to the protein solution. This 
suspension was rotated end-to-end at room temperature overnight. At this point, excess reactive groups 
were blocked by adding Tris and /?-mercaptoethanol to the suspension and rotated for three hours at room 
temperature. After the incubation, the gel was washed with the coupling buffer and three cycles of a pH 4 
buffer (0.1 M acetate, 0.5 M sodium chloride) and a pH 8 buffer (0.1 M Tris, 0.5 M sodium chloride). The 

55 gel was stored in phosphate buffered saline at 4 C. 

The extent of immobilization was determined by the difference in absorbance at 280 nm before and 
after immobilization. Interfering elements in the reaction were removed by dialysis. Alternatively, any 
standard protein quantitation assay (e.g., BCA, Lowry or Coomassie blue) could be used to determine the 
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extent of protein immobilization. 

EXAMPLE XVIII 

5 



Determination of the Binding Capacity of Maleimide Immobilized Protein G Variants 

10 

After the immobilized protein was packed into a column (0.7 x 2.6 cm), it was washed with 1.0 M acetic 
acid and reequilibrated with PBS (pH 7.0). Either purified monoclonal antibodies, serum or other crude 
starting material was then applied onto the column. After the column was washed extensively with PBS, it 
was eluted with 5 mM ammonium acetate (pH 5), 1.0 M acetic acid and equilibrated with PBS at the end. 
15 The amount of protein was quantitated by absorbance at 280 nm using extinction coefficients of 1.4 and 
1 .49 for human and mouse antibodies, respectively. 

This one ml column contained 4.89 mg of Protein G variant Type 3 and bound 29.5 mg of human IgG 
and 20 mg of a mouse monoclonal lgG1 antibody. 

It is believed that the introduction of a cysteine residue into the protein and the use of a thiol-specific 
20 chemistry for immobilization have resulted in a protein product which is not modified extensively. Thus, its 
potential for binding to antibodies is much enhanced. 

For comparison, the amount of Protein G variants 2 and 3 immobilized per ml of gel using various 
immobilization chemistries is shown in Table 6. Also shown in Table 6 is the binding capacity of 
immobilized variants 2 and 3 for human IgG and mouse monoclonal IgG. As can be seen in Table 6, the 
25 type 3 variant is immobilized to a much greater extent by use of maleimide chemistry. Moreover, the type 3 
variant immobilized by maleimide chemistry has unexpectedly higher binding capacity for human IgG and 
mouse monoclonal IgG. 



Table 6 

30 









Binding Capacity Per ml of 










Gel: 


Type 


Chemistry 


Protein G 


Human IgG 


Mouse 






Immobilized Per ml 




Mono-Clonal 






of Gel 




lgG1 


2 


aldehyde 


3.33 mg 


12.4 mg 


3.3 mg 


2 


tresyl 


2.50 mg 


12.5 mg 


1.8 mg 


3 


tresyl 


2.85 mg 


16.7 mg 


5.2 mg 


3 


maleimide 


4.87 mg 


29.5 mg 


20.0 mg 



Having now fully described this invention, it will be appreciated by those skilled in the art that the same 
45 can be performed with a wide range of equivalent parameters of composition, conditions, and methods of 
preparation without departing from the spirit or scope of the invention or any embodiment thereof. 



Claims 

50 

1 . A Protein G variant. 

2. A Protein G variant as claimed in claim 1 which is immobilized on a solid phase support. 

3. A variant as claimed in claim 1 or 2 which comprises the B2 binding domain (Type 5) and has the amino 
acid sequence: 
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5 ie is 

1MDPGDASELTPPVTT 
21TKPVDAETAEKAFKQ 
eiATKTFTVTEMVTEVP 
91PLVPLTPPTPIPKDD 
121 A K K D D ft K K A E T ft G. 



£G £3 3C 

YKLVX NGKTLKGETT 
YPNDNGVDGVWTYDD 
RGDPPTEPEKPEPS1 
AKKDDTKKEDAKKPE 



70 



75 



4. A variant as claimed in claim 1 or 2 which comprises the binding domain B2 (Type 7) and has the amino 
acid sequence: 

5 1C 15 2* 25 30 

1MOPGOASELTPAVTTYKLVINGKTLKGETT 

31TKAVDAETAEKAPK0YANDNGVDGVWTYDD 

S1ATKTFTVTEMV7EVPVASKRKED, 

5. A variant as claimed in claim 1 or 2 which comprises the binding domain B1 (Type 10) and has the 
amino acid sequence: 



20 



25 



S to 15 2© £5 30 

1MDPYPLPKTDTYKLILNGKTLKGETTTEPV 
31DPPTPEKVFKQYPNDNGVDGEWTYDDPTKT 
61FTVTEKPEV I DVPRGDAPTEPEKPEAS1 PL 
S1VPLTPPTP X AKODAKKODTKKEDAKKPEPK 
121 K D D A K K A E T A G. 

6. A variant as claimed in claim 1 or 2 which comprises the hybrid binding domain Bl'/B2 (Type 11) and 
has the amino acid sequence: 

5 1G 15 K 3© 

30 1KDPYPLPKTDTYKLILNGKTLKGETTTEAV 
31DAATAEKAFKQYANDNGV0GVWTYDDATKT 
felFTVTEMVTEVPRGDAPTEPEKPEPSIPUVP 
91LTPATPIAKDDAKKPDTKKEDAKKPEAKK0 
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50 



1£1 DAKKAET AG. 



7. A variant as claimed in any of claims 1 to 5 having a single domain. 

8. A variant as claimed in any of claims 2 to 7 wherein the solid phase support is agarose. 

9. A variant as claimed in any of claims 2 to 8 which is immobilized to the solid phase support by linkage 
through a cysteine group in the Protein G variant. 

40 10. A variant as claimed in any of claims 2 to 9 wherein the solid phase support is maleimide-activated 

agarose. 

11. A variant as claimed in claim 1 or 2 which comprises the binding domains B1 and B2 (Type 3) and has 
the amino acid sequence: 

45 ■ 5 1* 15 £* £5 3C 

1 HDPYPLPKTDTYKL I LNGKTLKGETTTEAV 
21DAATAEKVFKGYANDNGV0GEWTYDDATKT 
fel FTVTEKPEVI DABELTPAVTTYKLV I NGKT 
91LKGETTTKPVDPETPEKPFK0YPNDNGVDG 



121 



VWTYODATKT. FTVTEWVTEVPR6C 



12. A variant as claimed in claim 1 or 2 which comprises the binding domains B1 and B2 (Type 8) and has 
the amino acid sequence: 



55 
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5 ie is 

1HDPYPLPKTDTYKLI 
31DAATAEKVFKQYAND 
fclFTVTEKPEVXDASEL 
91LKGETTTKAVDAETA 
V M T Y D D ft T K T F T V T E 
151KPEA6XPLVPLTPAT 
161 D A K K P E A K K S D A K K fi 



£* £5 30 

LNGKTLKGETTTEAV 
NGVDGEWTYDDATKT 
TPAVTTYKLVINGKT 
EKAFKGYANDNGVDG 
MVTEVPRGDAPTEPE 
PXAKDDAKKDDTKKE 
E T A P 6 C. 



13. A variant as claimed in claim 1 or 2 which comprises the binding sites B1 and B2 (Type 9) and has the 
10 amino acid sequence: 



75 



1 

31 
fcl 
91 
121 
151 



i© is ee 25 

PKTDTYKL ILNGKTLKGETT 
KVFKOYANDNGVDGEWTYDD 
p E V I DASELTPAVTTYKLVI 
TKAVDAE7 AEKAFKQYANDN 
ATKTFTVT EMVTEVPRGDAP 
PLVPLT6C* 



3C 

ft V 



20 14. A variant as claimed in claim 1 or 2 which comprises the binding domains B1 and B2 (Type 1) and has 
the amino acid sequence: 



5 ie 

1MOPALPKTDTYKL 
31 AATAEKVFKGYA*N 
61TVTEKPEVIDASE 
91K6ETTTKAVDAET 
121 WTYDDATKTFTVT 



IS 20 25 3C 

ILNGKTLKGETTTEAVD 
DNGVDGEUTYDDATKTF 
LT P A'VTTYKLV I NGKTL 
AEKAFKDYANDNGVDGV 
EWVTEVPVftSKRKED, 
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30 15. A variant as claimed in claim 1 or 2 which comprises the binding domains B1 and B2 (Type 2) and has 
the amino acid sequence: 

5 1ft 15 2* 25 3C 

l MDPYPLPKTDTYKL X LNGKTLKGETTTEAV 
31DAATAEKVFKQYANDNGVDGEWTY00ATKT 
AlFTVTEKPEVXDASELTPAVTTYKLVINGKT 
91LKGETTTKAVDAETAEKAFKQYANONGVDG 
ill VWTYDOATKTFTVTEMVTEVPRGOAPTEPE 
151 KPEAS?PUVPLTPATP XAKDDAKKDDTKKE 
1810AKKPEAKKD0AKKAETAG. 

16. A variant as claimed in claim 1 or 2 which comprises the binding domains B1 and B2 (Type 4) and has 
the amino acid sequence: 



40 



1 A G 0 P X 

31 R N E E 6 

61 T T T E A 

91 D D A T K 

121 V X N 6 K 

151 D N 6 V D 

181 A P T E P 

£11 D 0 T K K 



1C 

E D T P X X 
A T A G Y P 
V D A A T A 
T F T V T E 
TLKGET 
6 V W T Y D 
E K P E A S 
E D A K K P 



15 

R N G G E L 
L P K T D T 
E K V F K 0 
K P E V X D 
T T K A V D 
D A T K T F 
X P L V P L 
E A K K D D 



T H L L G N 

Y K L X L N 

Y A N D N G 
A G E L T P 
A E T A E K 
T V T E H V 
T P A T P X 
A K K A £ T 



25 3* 

6 E T T L A L 

G K T L K 6 E 

V D G E W T Y 

A V T T Y K L 

A F K Q Y A N 

T E V P R S D 

A K D D A K K 
A G, 



17. A variant as claimed in claim 1 or 2 which comprises the binding domains B1, B1/B2, and B2 (Type 6) 
and has the amino acid sequence: 
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10 



15 



40 



45 



50 



55 



5 ie xs £" 3* 

1MDPYPLPKTDTYKLILNGKTLKGETTTEOV 

31Dfi«T«EKVFKQYflNDNGVDBEWTYDDATKT 
61FTVTEKPEVIDBSELTPP.VTTYKLVINGKT 
91LKGETTTEP.VDP.P.TP.EKVFK0YP.NDNGVDG 
J21EWTY0DPTKTFTVTEKPEVIDPSELTPOVT 

SslssSHls?SS5SSSSSiSfcSSfe5SSiS-*SS.» 



18. A method of isolating Fab or F(ab') 2 fragments from IgG and Fc fragments, the method comprising: 

(a) contacting a sample comprising Fc fragments, IgG and at least one of Fab or F(ab ) 2 fragments with 
a single domain Protein G variant immobilized on a solid phase support; and 

(b) washing the solid phase support with a buffer with a pH of from 5 to 8 to give the Fab and F(ab ) 2 
fragments in the eluate and give a solid phase support with Fc fragments and IgG bound thereon. 

19. A method of affinity chromatography of a target molecule, the method comprising: 

(a) contacting a first sample comprising a target molecule-specific IgG with a Protein G variant 
immobilized to a solid phase support to give target molecule-specific IgG irreversibly immobilized to the 
solid phase support; 

20 (b) contacting a second sample comprising a target molecule with the solid support having the target 
molecule-specific IgG immobilized thereon; 

(c) washing the solid phase support with a buffer of pH 5 to 8 to eiute unbound solute; and 

(d) eluting the target molecule with a buffer of pH 3.5 to 2.4 to give the target molecule in the eluate. 

20. A method of fractionating the subclasses of IgG, the method comprising: 

25 (a) applying a sample comprising at least two subclasses of IgG to a column containing a Protein G 
variant immobilized to a solid phase support; 

(b) eluting the column with a continuous or discontinuous gradient comprising a buffer of pH 8 to 2.4; 
and 

(c) collecting fractions of eluate containing the IgG subclasses. 
30 21 . A method of isolating IgG, the method comprising: 

(a) contacting a sample comprising IgG with a Protein G variant comprising at least one cysteine residue 
immobilized on a solid phase support through the cysteine residue; 

(b) washing the solid phase support with a buffer of pH 5 to 8 to eiute unbound solute; and 

(c) washing the solid phase support with a buffer of pH 3.5 to 2.4 to give IgG in the eluate. 
35 22 A method of purifying IgG in a sample, the method comprising: 

(a) contacting a sample comprising IgG with a Protein G variant immobilized to a solid phase support; 

(b) washing the solid phase support with a buffer of pH 5 to 8 to give the impurities in the eluate and IgG 
bound to the solid phase support; and 

(c) washing the solid phase support with a buffer of pH 3.5 to 2.4 to give IgG in the eluate. 
23 A nucleic acid sequence encoding for Protein G or a Protein G variant. 

24. A nucleic acid sequence encoding for an active binding site or a portion thereof of Protein G or A 
Protein G variant. 

25. Recombinant Protein G or a variant thereof. 

26. A vector comprising a nucleic acid sequence encoding for Protein G or a variant of Protein G, or an 
active binding site thereof. 

27. A vector as claimed in claim 26 which is lambda gt11, m13mp9, pBR322 or pGx1066. 

28. A host transfected with a vector as claimed in claim 26 or 27. 

29. A host as claimed in claim 28 which is E. coli or Bacillus subtilis . 



Claims for the following Contracting State: ES 

1. A process for the preparation of a Protein G variant, the process comprising coupling successive amino 

acids together. , , 

2. A process as claimed in claim 1 additionally comprising immobilizing the variant on a solid phase 

support. _ x . 

3. A process as claimed in claim 1 or 2 wherein the variant comprises the B2 binding domain (Type 5) and 

has the amino acid sequence: 
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5 1© 15 as £5 30 

1HDPGDAGELTPAVTTYKLVIN6KTLKGETT 
->1TKAVDAETAEKAFKQYANDNGVDGVWTYDD 
61ATKTFTVTEMVTEVPRGDAPTEPEKPEAS1 
91 PLVPLTPATP I AKDDAKKDDTKKEDAKKPE 
lSiAKKODAKKAETAG- 



10 



75 



4. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domain- B2 (Type 7) and 
has the amino acid sequence: 

5 1C 15 25 30 

1MDPGDASELTPBVTTYKLVI NGKTLKGETT 
31 TKAVDAETAEKAP K0YANDN6VDGVWTYDD 
61ATKTFTVTEMVTEVPVASKRKED, 

5. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domain B1 (Type 10) and 
has the amino acid sequence: 



20 5 13 15 2C £5 3B 

1MDPYPLPKTDTYKL X LNGKTLKGETTTEAV 
31DAATAEKVFK0YANDNGVDGEWTYDDATKT 
G1FTVTEKPEVIDVPRGDAPTEPEKPEAS1PL 
91 VPLTPATP IAKDDAKKDDTKKEOAKKPEAK 
lcl KDDAKKAETA-G. 

25 

6. A process as claimed in claim 1 or 2 wherein the variant comprises the hybrid binding domain Bl'/B2 
(Type 11) and has the amino acid sequence: 

5 ie is ee es 38 

30 1HDPYPLPKTDTYKL 2 LNGKTLK6ETTTEAV 

31DAATAEKAFKQYANDNGVDGVWTYDDATKT 
fclFTVTEWVTEVPRGOAPTEPEKPEASlPLVP 
91LTPATPXAKDDAKKDDTKKEDAKKPEAKKD 
lcl 0 A K K A E T A G. 

35 

7. A process as claimed in any of claims 1 to 5 wherein the variant has a single domain. 

8. A process as claimed in any of claims 2 to 7 wherein the solid phase support is agarose. 

9. A process as claimed in any of claims 2 to 8 wherein the Protein G is immobilized to the solid phase 
support by linkage through a cysteine group in the Protein G variant. 

40 10. A process as claimed in any of claims 2 to 9 wherein the solid phase support is maleimide-activated 
agarose. 

11. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domains B1 and B2 
(Type 3) and has the amino acid sequence: 



45 



50 



• s le is ee es 3b 

1KOPYPLPKTDTYKL X LNGKTLKGETTTEAV 
31 DAATAEKVFKQYANDNGVOGEWTYDDATKT 
61FTVTEKPEVIOABELTPAVTTYKLV2NGKT 
91LKGETTTKAVDAETAEKAFKGYAN0N6VDG 
IfclVUTYDDATKT. FTVTEHVT'EVPRGC* 

12. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domains B1 and B2 
(Type 8) and has the amino acid sequence: 



55 
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K T 


61 


F 


T 


V 


T 


E 
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V I 


D 


A 


S 


E L 


T 


P 
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T 


Y 


KLV 


I 


N 
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K T 


91 


L 


K 


G 
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T 


T 


T 


K 


A V 


D 


A 


E 


T A 


E 


K 


A 


F K 


Q 


Y 


AND 


N 
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V 


D G 


121 
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13. A process as claimed in claim 1 or 2 wherein the variant comprises the binding sites B1 and B2 (Type 
to 9) and has the amino acid sequence: 

5 1C 15 8* ES 3C 

i wdpyplpktdtyklilngktlkgeYtteav 
31 daataekvfkgyandngvdgewtyddatkt 

75 fciFTVTEKPEVlOAEELTPAVTTYKLVINGKT 
91LKGETTTKAVDAE1AEKAFKGYANDNGV0G 
121VWTYDDATKTFTVTEMVTEVPRGDAPTEPE 
151 K P E fl B I P L V P L T E C, 

20 14. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domains B1 and B2 
(Type 1 ) and has the amino acid sequence: 

5 10 15 20 25 30 

1 MDPALPKTDTYKL X LNGKTLKGETTTEAVD 
25 31 AATAEKVFKQYAtJDNG V-D GEWTYDDATKTF 

61TVTEKPEVI DASELTP A'V TTYKLVINGKTL 
91K6ETTTKAVDAETAEKAFKQYAN0NGVDGV 
121UTYDDATKTFTVTEWVTEVPVASKRKED. 

30 15. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domains B1 and B2 
(Type 2) and has the amino acid sequence: 

5 ie IS 20 25 30 

1 MOPYPLPK'TOTYKL 1 LNGKTLKGETTTEAV 
31DAATAEKVFKQYANDNGVDGEWTYD0ATKT 
35 &1FTVTEKPEU2DASE L T PAVTTYKLVI NGKT 

S1LKGETTTKAVDAETAEKAFKQYANDNGVDG 
121VWTYD0ATKTFTVTEHVTEVPRG0APTEPE 
151 KPEA6XPLVPLTPATPXAK0DAKK00TKKE 
181DAKKPEAKKDDAKKAETAG. 

40 

16. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domains B1 and B2 
(Type 4) and has the amino acid sequence: 



9 10 15 M W 30 

GDPIEDTPX IRNGGELTNLL6NBETTLAL 
^^NEEBATAGYPLPKTDTYKLXLNGKTLKGE 
ei?TTEAV0AATAEKVFKQYANDNGVD6EWTY 
910OATKTFTVTEKPEVIDASELTPAVTTYKL 
21VIN6KTLKGETTTKAVDAETAEKAFK0YAN 



45 1 A 

31 RNEE6AT 



DN6 D6VMTYDDATKTFTVTEMVTEVPRSD 
50 IBl APTEPEKPEAS I PLVPLTPATPXAKDDAKK 

211 DOTKKEDAKKPEAKKDDAKKAETAG. 

17. A process as claimed in claim 1 or 2 wherein the variant comprises the binding domains B1, B1/B2, and 
B2 (Type 6) and has the amino acid sequence: 

55 
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1MDPYPLP 
31 D A T fi E K 
eiFTVTEKP 
91 LKGETTT 
1£1 E W T Y D D A 
151 T Y K L V I N 
181 Q Y P N D N G 
£11 P R G D A P T 
£A1 D A K K D t> T 



5 



ie is £5 3? 

KTDTYKLILNGKTLKGETTTEOV 
V F K Q Y PNDNGVDGEWTYDDPTKT 
EV 2 DASELTPAVTTYKLVI NGKT 
EAVDAATAEKVFKDYANDNGVDG 
TKTFTVTEKPEVI DASELTPGVT 




18. A method of isolating Fab or F(ab') 2 fragments from IgG and Fc fragments, the method comprising: 

(a) contacting a sample comprising Fc fragments, IgG and at least one of Fab or F(ab ) 2 fragments with 
a single domain Protein G variant immobilized on a solid phase support; and t 

(b) washing the solid phase support with a buffer with a pH of from 5 to 8 to give the Fab and F(ab ) 2 
fragments in the eluate and give a solid phase support with Fc fragments and IgG bound thereon. 

19. A method of affinity chromatography of a target molecule, the method comprising: 

(a) contacting a first sample comprising a target molecule-specific IgG with a Protein G variant 
immobilized to a solid phase support to give target molecule-specific IgG irreversibly immobilized to the 
solid phase support; 

(b) contacting a second sample comprising a target molecule with the solid support having the target 
molecule-specific IgG immobilized thereon; 

(c) washing the solid phase support with a buffer of pH 5 to 8 to elute unbound solute; and 

(d) eluting the target molecule with a buffer of pH 3.5 to 2.4 to give the target molecule in the eluate. 

20. A method of fractionating the subclasses of IgG, the method comprising: 

(a) applying a sample comprising at least two subclasses of IgG to a column containing a Protein G 
variant immobilized to a solid phase support; 

(b) eluting the column with a continuous or discontinuous gradient comprising a buffer of pH 8 to 2.4; 



(c) collecting fractions of eluate containing the IgG subclasses. 

21. A method of isolating IgG, the method comprising: 

(a) contacting a sample comprising IgG with a Protein G variant comprising at least one cysteine residue 
immobilized on a solid phase support through the cysteine residue; 

(b) washing the solid phase support with a buffer of pH 5 to 8 to elute unbound solute; and 

(c) washing the solid phase support with a buffer of pH 3.5 to 2.4 to give IgG in the eluate. 

22. A method of purifying IgG in a sample, the method comprising: 

(a) contacting a sample comprising IgG with a Protein G variant immobilized to a solid phase support; 

(b) washing the solid phase support with a buffer of pH 5 to 8 to give the impurities in the eluate and IgG 
bound to the solid phase support; and 

(c) washing the solid phase support with a buffer of pH 3.5 to 2.4 to give IgG in the eluate. 

23. A process for the preparation of a nucleic acid sequence encoding for Protein G or a Protein G variant, 
the process comprising coupling successive nucleotides together. 

24. A process for the preparation of a nucleic acid sequence encoding for an active binding site or a portion 
thereof of Protein G or A Protein G variant, the process comprising coupling successive nucleotides 



25. A process for the preparation of recombinant Portein G or a variant thereof, the process comprising 
coupling successive nucleotides together. 

26. A process for the preparation of a vector the process comprising incorporating a nucleic acid sequence 
encoding for Protein G or a variant of Protein G, or an active binding site thereof, into nucleic acid of a 
vector. 

27. A process as claimed in claim 26 wherein the vector is lambda gt11, m13mp9, pBR322 or pGx1066. 

28. A process comprising transfecting a host with a vector as claimed in claim 26 or 27. 

29. A process as claimed in claim 28 wherein the host is E. coli or Bacillus subtilis . 



and 



together. 



44 



EP 0 408 178 A1 




If " ' ■ ■ ■ ■ I l ff I 

h l-9kbp H 

PARTIAL RESTRICTION MAP OF PLASMID pGX4533 



Fig. 2 



45 



EP 0 408 178 A1 



AAGCTTT6GTGGAGAAATTGGCTGGCGAATCCAGCTTCACCGGTGTTTCA 50 

CCAGTAGATGCTTTCTGTGGTCTTATTGACACGCACTTGTGGCGAGAGTA 100 

CTAACAGTCACAGCGACGTTAACTTTATTTTCCTTATGAGAGGTTAAGAA 150 

AAAACGTTATTAAATAGCAGAAAAGAATATTATGACTGACGTTAGGAGTT 200 

TTCTCCTAACGTTTTTTTTAGTACAAAAAGAGAATTCTCTATTATAAATA 250 

AAATAAATAGTACTATAGATAGAAAATCTCATTTTTAAAAAGTCTTGTTT 300 

TCTTAAAGAAGAAAATAATTGTTGAAAAATTATAGAAAATCATTTTTATA 350 

CTAATGAAATAGACATAAGGCTAAATTGGTGAGGTGATGATAGGAGATTT 400 

ATTTGTAAGGATTCCTTAATTTTATTAATTCAACAAAAATTGATAGAAAA 450 

ATTAAATGGAATCC TTGATT TAATTTTATTAAGTTG TATAAT AAAAAGTG 500 

-35 -10 

AAATTATTAAATCGTAGTTTCAAATTTGTCGGCTTTTTAATATGTGCTGG 550 

MET GLU LYS GLU LYS 
CATATTAAAATTAA AAAAGGAG AAAAA ATG GAA AAA GAA AAA 592 

rbs 
10 

LYS VAL LYS TYR PHE LEU ARG LYS SER ALA PHE GLY LEJ 

AAG GTA AAA TAC TTT TTA CGT AAA TCA GCT TTT GGG TTA 631 

20 30 
ALA SER VAL SER ALA ALA PHE LEU VAL GLY SER THR VAL 
GCA TCC GTA TCA GCT GCA TTT TTA GTG GGA TCA ACG GTA 670 

40 

PHE ALA VAL ASP SER PRO ILE GLU ASP THR PRO ILE ILE 

TTC GCT GTT GAT TCA CCA ATC GAA GAT ACC CCA ATT ATT 709 

50 

ARG ASN GLY GLY GLU LEU THR ASN LEU LEU GLY ASN SER 

CGT AAT GGT GGT GAA TTA ACT AAT CTT CT6 GGG AAT TCA 748 

60 70 
GLU THR THR LEU ALA LEU ARG ASN GLU GLU SER ALA THR 
GAG ACA ACA CTG GCT TTG CGT AAT GAA GAG AGT GCT ACA 787 



FIG. 3 
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60 

LEU THR ALA ALA ALA VAL ALA ASP THR VAL ALA 

TTG ACA GCA GCA GCG GTA GCC GAT ACT GTG GCA 826 

90 

ALA ALA GLU ASN ALA GLY ALA ALA ALA TRP GLU 

GCA GCT GAA AAT GCT GGG GCA GCA GCT TGG GAA 865 

100 

ALA ALA ALA ASP ALA LEU ALA LYS ALA LYS ALA 

GCA GCA GCA GAT GCT CTA GCA AAA GCC AAA GCA 904 

120 

LEU LYS GLU PHE ASN LYS TYR GLY VAL SER ASP 

CTT AAA GAA TTC AAC AAA TAT GGA GTA AGT GAC B43 

130 

LYS ASN LEU ILE ASN ASN ALA LYS THR VAL GLU 

AAG AAT CTA ATC AAC AAT GCC AAA ACT GTT GAA 982 

140 

LYS ASP LEU GLN ALA GLN VAL VAL GLU SER ALA 

AAA GAC CTT CAA GCA CAA GTT GTT GAA TCA GCG 1021 

160 

ALA ARG ILE SER GLU ALA THR ASP GLY LEU SER 

GCG CGT ATT TCA GAA GCA ACA GAT GGC TTA TCT 1060 

170 

LEU LYS SER GLN THR PRO ALA GLU ASP THR VAL 

TTG AAA TCG CAA ACA CCT GCT GAA GAT ACT GTT 1099 

180 

ILE GLU LEU ALA GLU ALA LYS VAL LEU ALA ASN 

ATT GAA TTA GCT GAA GCT AAA GTC TTA GCT AAC 1138 

190 200 

LEU ASP LYS TYR GLY VAL SER ASP TYR HIS LYS 

CTT GAC AAA TAT GGA GTA AGT GAC TAT CAC AAG 1177 

210 

ILE ASN ASN ALA LYS THR VAL GLU GLY VAL LYS 

ATC AAC AAT GCC AAA ACT GTT GAA GGT GTA AAA 1216 

220 

ILE ASP GLU ILE LEU ALA ALA LEU PRO LYS THR 

ATA GAT GAA ATT TTA GCT GCA TTA CCT AAG ACT 1255 

230 

TYR LYS LEU ILE LEU ASN GLY LYS THR LEU LYS 

TAC AAA TTA ATC CTT AAT GGT AAA ACA TTG AAA 1294 
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280 

THR VAL 
ACA GTT 



LEU THR 
TTA ACA 
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ASP ALA 
GAC GCA 
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AAC GAC 



ALA THR 
GCG ACT 



VAL PRO 

GTT CCT 

370 

ALA SER 

GCA AGT 



ILE ALA 

ATT GCT 



GLU ASP 
GAA GAT 



2S0 

THR THR THR GLU ALA VAL ASP ALA ALA THR ALA 

ACA ACT ACT GAA GCT GTT GAT GCT GCT ACT GCA 1333 

260 

VAL PHE LYS GLN TYR ALA ASN ASP ASN GLY VAL 

GTC TTC AAA CAA TAC GCT AAC GAC AAC GGT GTT 1372 

270 

GLU TRP THR TYR ASP ASP ALA THR LYS THR PHE 

GAA TGG ACT TAC GAC GAT GCG ACT AAG ACC TTT 1411 

290 

THR GLU LYS PRO GLU VAL ILE ASP ALA SER GLU 

ACT GAA AAA CCA GAA GTG ATC GAT GCG TCT GAA 1450 

300 

PRO ALA VAL THR THR TYR LYS LEU VAL ILE ASN 

CCA GCC GTG ACA ACT TAC AAA CTT GTT ATT AAT 1489 

310 

THR LEU LYS GLY GLU THR THR THR LYS ALA VAL 

ACA TTG AAA GGC GAA ACA ACT ACT AAA GCA GTA 1528 

320 330 

GLU THR ALA GLU LYS ALA PHE LYS GLN TYR ALA 

GAA ACT GCA GAA AAA GCC TTC AAA CAA TAC GCT 1567 

340 

ASN GLY VAL ASP GLY VAL TRP THR TYR ASP ASP 

AAC GGT GTT GAT GGT GTT TGG ACT TAT GAT GAT 1606 

350 

LYS THR PHE THR VAL THR GLU MET VAL THR GLU 

AAG ACC TTT ACG GTA ACT GAA ATG GTT ACA GAG 1645 

360 

GLY ASP ALA PRO THR GLU PRO GLU LYS PRO GLU 

GGT GAT GCA CCA ACT GAA CCA GAA AAA CCA GAA 1684 

380 

ILE PRO LEU VAL PRO LEU THR PRO ALA THR PRO 

ATC CCT CTT GTT CCG TTA ACT CCT GCA ACT CCA 1723 

390 

LYS ASP ASP ALA LYS LYS ASP ASP THR LYS LYS 

AAA GAT GAC GCT AAG AAA GAC GAT ACT AAG AAA 1762 

400 

ALA LYS LYS PRO GLU ALA LYS LYS ASP ASP ALA 

GCT AAA AAA CCA GAA GCT AAG AAA GAT GAC GCT 1801 
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AAGCTTT6GTGGAGAAATTGGCTGGCGAATCCAGCTTCACCGGTGTTTCA 
CCAGTAGATGCTTTCTGTGGTCTTATTGACACGCACTTGTGGCGAGAGTA 
CTAACAGTCACAGCGACGTTAACTTTATTTTCCTTATGAGAG6TTAAGAA 
A A A ACGTT ATT A AAT AGC AGA AA AG AAT ATT ATG ACTG ACGTT AGG AGTT 
TTCTCCTAACGTTTTTTTTAGTACAAAAAGAGAATTCTCTATTATAAATA 
AAATAAATAGTACTATAGATAGAAAATCTCATTTTTAAAAAGTCTTGTTT 
TCTTAAAG A A G A A A A T A AT T G TT G A A A A ATT AT A G A A A AT C ATTTT T A T A 
CTAATGAAATAGACATAAGGCTAAATTGGTGAGGTGATGATAGGAGATTT 
ATTTGTAAGGATTCCTTAATTTTATTAATTCAACAAAAATTGATAGAAAA 
ATTAAATGGAATCCTTGATTTAATTTTATTAAGTTGTATAATAAAAAGTG 



AAATTATTAAATCGTAGTTTCAAATTTGTCGGCTTTTTAATATGTGCTGG 

MET GLU LYS GLU LYS 

CATATTAAAATTAA AAAAGGAG AAAAA ATG GAA AAA GAA AAA 

rbs 

10 

LYS VAL LYS TYR PHE LEU ARG LYS SER ALA PHE GLY LEU 

AAG GTA AAA TAC TTT TTA CGT AAA TCA GCT TTT GGG TTA 

20 30 

ALA SER VAL SER ALA ALA PHE LEU VAL GLY SER THR VAL 

GCA TCC GTA TCA GCT GCA TTT TTA GTG GGA TCA ACG GTA 

40 

PHE ALA VAL ASP SER PRO ILE GLU ASP THR PRO ILE ILE 

TTC GCT GTT GAT TCA CCA ATC GAA GAT ACC CCA ATT ATT 

50 

ARG ASN GLY GLY GLU LEU THR ASN LEU LEU GLY ASN SER 

CGT AAT GGT GGT GAA TTA ACT AAT CTT CTG GGG AAT TCA 

60 70 

GLU THR THR LEU ALA LEU ARG ASN GLU GLU SER ALA THR 

GAG ACA ACA CTG GCT TTG CGT AAT GAA GAG AGT GCT ACA 
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ALA ALA 
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110 
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GAT GCC 
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GLY ILE 
GGC ATA 

150 
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AAG AAA 



ASP PHE 
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AAC CTA 



GLU LEU 
GAA CTG 



ASP THR 
GAC ACT 



80 

LEU THR ALA ALA ALA VAL ALA ASP THR VAL ALA 

TTG ACA GCA GCA GCG GTA GCC GAT ACT GTG GCA 826 

90 

ALA ALA GLU ASN ALA GLY ALA ALA ALA TRP GLU 

GCA GCT GAA AAT GCT GGG GCA GCA GCT TGG GAA 665 

100 

ALA ALA ALA ASP ALA LEU ALA LYS ALA LYS ALA 

GCA GCA GCA GAT GCT CTA GCA AAA GCC AAA GCA 904 

120 

LEU LYS GLU PHE ASN LYS TYR GLY VAL SER ASP 

CTT AAA GAA TTC AAC AAA TAT GGA GTA AGT GAC 943 

130 

LYS ASN LEU ILE ASN ASN ALA LYS THR VAL GLU 

AAG AAT CTA ATC AAC AAT GCC AAA ACT GTT GAA 982 

140 

LYS ASP LEU GLN ALA GLN VAL VAL GLU SER ALA 

AAA GAC CTT CAA GCA CAA GTT GTT GAA TCA GCG 1021 

160 

ALA ARG ILE SER GLU ALA THR ASP GLY LEU SER 

GCG CGT ATT TCA GAA GCA ACA GAT GGC TTA TCT 1060 

170 

LEU LYS SER GLN THR PRO ALA GLU ASP THR VAL 

TTG AAA TCG CAA ACA CCT GCT GAA GAT ACT GTT 1099 

180 

ILE GLU LEU ALA GLU ALA LYS VAL LEU ALA ASN 

ATT GAA TTA GCT GAA GCT AAA GTC TTA GCT AAC 1138 

190 200 

LEU ASP LYS TYR GLY VAL SER ASP TYR HIS LYS 

CTT GAC AAA TAT GGA GTA AGT GAC TAT CAC AAG 1177 

210 

ILE ASN ASN ALA LYS THR VAL GLU GLY VAL LYS 

ATC AAC AAT GCC AAA ACT GTT GAA GGT GTA AAA 1216 

220 

ILE ASP GLU ILE LEU ALA ALA LEU PRO LYS THR 

ATA GAT GAA ATT TTA GCT GCA TTA CCT AAG ACT 1255 

230 

TYR LYS LEU ILE LEU ASN GLY LYS THR LEU LYS 

TAC AAA TTA ATC CTT AAT GGT AAA ACA TTG AAA 1294 
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